Vol. 45, No. 5 September, 1938 


PSYCHOLOGICAL REVIEW PUBLICATIONS 


Psychological Review 


EDITED BY 


HERBERT S. LANGFELD 
PRINCETON UNIVERSITY 





CONTENTS 


William Stern: R. B. MacLeop 

The Concept of Ability: H.A. Carr anp F. A. Kincspury 

Changing Views of Psychology as Science: ArtTHUR G. BILLs 

A Criticism of Hull’s Goal Gradient Hypothesis: Jack, BUEL 

Operationism in Psychology: R. H. Waters anp L. A. Pennincton 414 


Psychological Physiology: Carrott C, Pratt 


A Note on Guilford’s Generalized Psychophysical Law: 
CarL Iver Hovianp 430 


The Reliability vs. the Validity of Test Scores: Harvey A. Carr .... 435 


Dr. Britt on the Concepts of Learning, Retention, and Reproduction: 
Hutsey Cason 441 





PUBLISHED BI-MONTHLY 
FOR THE AMERICAN PSYCHOLOGICAL ASSOCIATION 


BY THE 
PSYCHOLOGICAL REVIEW COMPANY 
PRINCE anp LEMON STS., LANCASTER, PA. 
axnp OHIO STATE UNIVERSITY, COLUMBUS, OHIO 


Entered as second-class matter July 13, 1897, at the post-office at Lancaster, Pa., under 
Act of Congress of March 3, 1879 








PuBLICATIONS OF 


THE AMERICAN PSYCHOLOGICAL ASSOCIATION 
Witriarp L. VALENTINE, Business Manager 


PsYCHOLOGICAL REVIEW 
Hersert S. Lancretp, Editor 
Princeton University 
Contains original contributions only, appears bi-monthly, January, March, May, July, 
ae and November, the six numbers comprising a volume of about 540 pages. 
Subscription: $5.50 (Foreign, $5.75). Single copies, $1.00. 


PsYCHOLOGICAL BULLETIN 
Joun A. McGeocsa, Editor 
Wesleyan University 
Contains s critical reviews of books and articles, earcheleg news and notes, university 


about 720 1 announcements. Appears monthly (10 , the annual volume comprising 
pages. Special issues of the BULLETIN consist of general reviews of recent 


work in some department of psychology. 
Subscription: $6.00 (Foreign, $6.25). Single copies, 60c. 
JouRNAL oF EXPERIMENTAL PsyCHOLOGY 
S. W. Fernsercer, Editor 
University of Pennsylvania 
Contains contributions of an experimental character. Appears monthly (since 
January, ~ two volumes per year, each volume of six numbers containing about 625 


pages. 
Subscription: $14.00 ($7.00 per volume; Foreign, $7.25). Single copies, $1.25. 


PsYCHOLOGICAL ABSTRACTS 
Watter S. Hunter, Ediivr 
Brown University 
Sapa: bee twole teens a a teas Sppianent mating 8 volme f about 
7 re at geenel  Goveted publication of non-critical abstracts of the 
s literature in psychology and closely related subjects. 
Subscription: $7.00 (Foreign, $7.25). Single copies, 75¢. 


PsYCHOLOGICAL MonoGRAPHS 

Joun F. Dasutetx, Editor 

. University of North Carolina 
penser age Segpemmberse or treatises or ee lee oe ae 2S 
important promptly and as units. single numbers varies accord- 
aie “Tie statnrae appear at reas intervals and are gathered into 

of about 500 pages. 
Subscription: $6.00 per volume (Foreign, $6.30). 


JournNAL oF ABNORMAL AND SocraL PsycHoLocy 
Gorpon W. A.tport, Editor 
Harvard University 
quarter A , October, the four numbers comprising a volume 
of 3 The eo all GUAEE Slidbal eetndions inte Olt ef cecal ed 
Paiviews, notes and news. 
Subscription: $5.00 (Foreign, $5.25). Single copies, $1.50. 


COMBINATION EATES 
and Bulletin: $10.00 (Foreign, $10.50) 


Bulletin and J. Exp. (2 vole.): $18.00 (rorsign. $1876). 
Review, Bulletin, and J. Exp. (2 vols.): $22.00 (Foreign, $23.00). 
Subscriptions, orders, and business communications should be sent to 
THE PSYCHOLOGICAL REVIEW COMPANY 
Tue Onto State University, Cotumsus, Ox10 








x 
a 
~ 











Vot. 45, No. 5 SEPTEMBER, 1938 





THE PSYCHOLOGICAL REVIEW 


WILLIAM STERN (1871-1938) 





The recent death of William Stern! partakes both of 
tragedy and of triumph. One grows sad at the thought of a 
man who, in the declining years of his life, is expelled because 
of a blind and cruel prejudice from the fatherland to which 
he has contributed forty years of scholarly and scientific 
service; who is forced to attempt a fresh start under the handi- 
cap of strange conditions and a strange language; and who dies 
while the future is still clouded with uncertainty. Yet the 
picture is not altogether depressing, for Stern is to be num- 
bered among the few scientists who have brought to ap- 
proximate completion a plan of work, outlined in advance and 
followed through, step by step, during a life-time of research 
and writing. With the publication in 1935 of his Allgemeine 
Psychologie auf personalisticher Grundlage,? the system of 
personalistic psychology finally received a formulation which 
will place it among the other great systems of modern psy- 
chology. To have achieved this in spite of the disruption of 
his life must be recorded as a triumph. 

The details of Stern’s academic history need not be 
presented here.* His name is associated particularly with 
Hamburg, where he was one of the founders and guiding 
spirits of the new, post-war university. He had, however, 
spent nineteen productive years at Breslau before the Ham- 
burg period, and after he joined Professor McDougall’s de- 
partment at Duke University he continued to be active both 

1 March 27, 1938. 

2 Allgemeine Psychologie auf personalisticher Grundlage, The Hague: M. Nijhoff, 
1935, pp. xxviii, 831. English translation by H. D. Spoerl, General Psychology from the 
Personalistic Standpoint, New York: The Macmillan Company, 1938, pp. xxii, 589. 


A History of Psychology in Autobiography, Edited by Carl Murchison, Vol. I, 
Worcester: Clark University Press, 1930, pp. 335-388. 
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in teaching and in research. There are probably few, if any, 
psychologists who have contributed significantly to as many 
fields of psychological investigation. ‘The book which has 
received widest notice in the English-speaking world is 
undoubtedly his child psychology. ‘This, however, is only 
one of a long list of publications which deal with problems of 
childhood and adolescence. One of the most valuable of 
these, and a work which is still to be considered a classic in 
its field, is the study of the language of children.’ His 
interest in the psychology of childhood led to the study of 
more practical problems connected with education, vocational 
guidance and mental measurement; and these, in turn, are 
to be linked up with more general works on applied psychology 
and the psychology of individual differences. As early as 
1g00 Stern wrote a book on differential psychology * which 
may be considered as the pioneer work in this field, and many 
‘mental testers’ have forgotten, or have never known, that 
he originated the concept of the Intelligence Quotient. As 
editor of the Zeitschrift fiir angewandte Psychologie and allied 
publications, Stern was instrumental in furthering this branch 
of psychology in Germany. In the more classical fields of 
experimental psychology Stern contributed an important 
study of the perception of change,’ and this interest was 
extended in later investigations of movement and of time. 
It is significant that, in spite of the relative paucity of purely 
‘experimental’ articles, he never in his work in applied and in 
child psychology lost sight of the fundamental experimental 
problems. 

It is not primarily as applied psychologist, as child psy- 
chologist, nor even as experimentalist, however, that Stern 

* Psychologie der friihen Kindheit bis sum sechsten Lebensjahre, Leipzig: Quelle u. 
Meyer, 1914, pp. 472. 4th ed., 1926. English translation, The Psychology of Early 
Childhood, New York: Henry Holt, 1926. 

® With Clara Stern, Monographien tiber die seelische Entwicklung des Kindes. 1. 
Die Kindersprache. Eine psychologische und sprachtheoretische Untersuchung, Leipzig: 
Barth, 1907, 3rd enlarged ed., 1922, pp. 434. 

*Supplanted by Die differentielle Psychologie in ihren methodischen Grundlagen , 
Leipzig, 1911, 3rd ed., 1921, pp. $45. 

7 Psychologie der Verdnderungsauffassung, Breslau: Preuss u. Jiinger, 1898, pp. 264, 
2nd ed., 1906. 
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is to be remembered. He was first and foremost a systematic 
thinker. For him psychology was simply a road leading 
toward an understanding of man, and ultimately toward an 
understanding of the universe. The personalistic psychology 
is an application within a limited region of a point of view and 
a method which are contained in the philosophy of Critical 
Personalism. Stern preferred, in fact, to regard his psy- 
chology as a special branch of a more general science of per- 
sonalistics, a science which he hoped would one day unite 
within a single system the findings of all the sciences of man, 
the physiological, the psychological and the cultural-historical. 
The scheme of such a science, and its bearing on the funda- 
mental problems of metaphysics, epistemology and value- 
theory, are outlined in a three-volume work, Person und 
Sache,* the first volume of which appeared in 1906 and the 
last in 1924; and it is here that we find presented most clearly 
the guiding ideas which persisted throughout Stern’s life and 
gave coherence to all his thought. 

The philosophy of critical personalism can be considered 
on its own merits, and need not be discussed here. One 
common confusion, however, which arises from the name, 
should be mentioned. The Person, in its philosophical 
meaning, is not to be identified with personality or indi- 
viduality, nor is Thing to be equated merely to ‘object.’ 
Stern’s system is to be understood in the light of a philosophi- 
cal situation in which the only obvious alternative to a naive 
teleology seemed to be a crude mechanism, an alternative 
which Stern refused to accept. The third alternative, a 
dualism which recognizes one set of laws for things and 
another set of laws for persons, was equally abhorrent to him. 
The personalistic solution involves a transcendence of this 
opposition, and a recognition of mechanical laws as operating 
within the context of immanent purposes. Any totality, 
then, which is segregated and which behaves with reference 
to immanent ends, may be considered a Person. Thus the 

8 Person und Sache: System des kritischen Personalismus. Vol. 1: Ableitung und 
Grundlehre des kritischen Personalismus, Leipzig: Barth, 2nd ed., 1923, pp. xx, 434. 


Vol. II: Die menschliche Persénlichkeit, Leipzig: Barth, 3rd ed., 1923, pp. xx, 272. 
Vol. III: Wertphilosophie, Leipzig: Barth, 1924, pp. xx, 474. 
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term may be applied equally well to a human being, a nation, 
a race or even a physiological system. The term Thing, 
on the other hand, refers to entities or processes which are 
activated by purposes extraneous or superordinate to them- 
selves. It is perhaps unfortunate that these terms have been 
chosen, since personalism suggests almost inevitably sub- 
jectivism and anthropomorphism. As Stern has developed 
the view, however, it implies rather a kind of concrete idealism, 
which recognizes the existence in nature of a hierarchy of 
entities (Persons), the properties and behavior of which are to 
be understood not in terms of the properties of their parts, 
but in terms of immanent purposes of the entities themselves. 
Instead of mechanical laws it gives us personal laws, and 
instead of mechanical causation personal causation. As a 
system it is thus anti-mechanical and anti-atomistic in the 
extreme; beside it the Gestalt theory seems moderate and 
almost conservative in comparison. 

The challenge of a system of thought is measured in terms 
of the unexamined assumptions which it uncovers in our own 
thinking. Stern’s personalistic psychology is in this sense 
challenging. His doctrine of space and time as personal 
dimensions is a case in point. For many years experimen- 
talists in the field of perception endeavored to explain the 
spatial and temporal properties of experience through an 
analytic study of the special senses. If, however, we begin 
with a naive phenomenological description of the world in 
which we live, we find that specific sensory components 
cannot be isolated; and if we refine our observations experi- 
mentally we find that spatiality and temporality cannot be 
reduced to terms of specific senses. Evidently space and 
time are intersensorial in character; they underlie all per- 
ceptual experience. When we remember, in this connection, 
that other types of experience, such as brightness and rough- 
ness, have also proved to be intersensorial, we are led naturally 
to the more fundamental question of the status of the senses in 
the whole perceptual process. Can it be that the special 
modalities of sensation are partial differentiations out of a 
more primitive, more generalized, pre-sensory kind of experi- 
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ence, possibly out of a primitive affectivity, with different 
modalities representing different stages of development? 
If so, we must recognize that perception must ultimately be 
understood in terms of the person as a whole; that to con- 
centrate on the special senses, 1.¢., on the ways in which the 
senses differ from one another, is to render a complete under- 
standing of perception impossible. It is a person as a person 
which perceives and thinks and acts, and a psychological ac- 
count of these processes can consequently be given only in 
terms of persons. Such a view as this disposes at once of all 
sensationistic and associationistic theories of perception. Simi- 
lar views can be, and have been, propounded without the as- 
sumptions of a personalistic psychology. The fact remains, 
however, that the personalistic approach may lead to fruitful 
experimental hypotheses. 

Apart from the reformulation of specific problems, how- 
ever, the personalistic way of thinking demands a reconsidera- 
tion of certain much more fundamental questions. What, 
for instance, is a psychological fact? And what constitutes a 
psychological explanation? It is true that the answers to 
such questions implied by a personalistic psychology are no 
longer as disturbing as they might have been a few years ago. 
Some contemporary writers would, in fact, adopt an even 
more radical position than that of Stern. Nevertheless, the 
ghosts of Helmholtz and of James Mill are still walking. 
There are still those who seem to believe that the psycho- 
logical fact par excellence is a microscopic fact, a simple 
sensation if the interest be in the cognitive problem, a simple 
reflex if the interest be in motivation; that the grosser data, 
perceptual objects and purposive actions, are psychologically 
less primary, less real and less valid. According to such a 
view, the process of psychological explanation must be a 
process of reduction, a restatement of the macroscopic and 
logically complex in terms of the microscopic and logically 
simple; or, perhaps more commonly, a restatement of facts of 
experience and behavior in terms of the constructs of a science, 
e.g., physiology, which is considered to be more fundamental. 
Such a view implies, obviously, a doctrine of causality which 
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assigns special causal significance to microscopic processes, 
and discounts, if it does not deny altogether, the causal 
significance of macroscopic processes. 

The effect of such tendencies has been to ‘depersonalize’ 
psychology. Until recently the conventional textbook has 
presented in great detail information about the elementary 
processes of sensation, reaction, and the like, and in a final 
hasty chapter has assembled them together in a composite 
picture of personality. It has been tacitly assumed that the 
living, perceiving, striving human being is something which 
can be explained, but which as such can play no part in the 
explaining process. The reaction against this depersonaliza- 
tion of psychology has already set in. To what extent it 
has been due to the influence of Stern will have to be deter- 
mined by the future historian. It is safe to say, at any rate, 
that in Stern’s personalistic system the logical groundwork of 
a non-atomistic and non-mechanical psychology of personality 
has been laid. As soon as we give up the prejudice in favor 
of microscopic causation, and recognize that macroscopic 
entities and processes have a valid scientific status, and can 
consequently function as such in causal contexts, we find 
ourselves free to consider the person as a psychological datum 
and as a causal agent. 

A psychology which recognizes persons as valid psycho- 
logical facts demands a reconsideration not only of the 
general problem of motivation, but also of the more specialized 
problems of perception, learning and co-ordinate fields. 
As it happens, Stern has been more successful in the latter 
task than he has in the former.* The present writer feels 
that in its application to the dynamics of behavior the 
personalistic psychology has been dominated far too much 
by the immanent teleology of the personalistic metaphysics. 
To recognize goal-directedness as psychologically contained 
within the behavior of persons is one thing; to interpret this 
directedness in terms of transcendent goals which are logically 

* Allport has made this criticism in his penetrating review of the Allgemeine 
Psychologie. Cf. G. W. Allport, The personalistic psychology of William Stern, 
Character and Personality, 1937, $, 231-246. 
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implied by the behavior is an altogether different matter. 
The one is psychology, the other metaphysics. It is un- 
fortunate that Stern’s metaphysical convictions have led in 
his psychology to a confusion of psychological content and 
logical implication. Yet, paradoxically, it was this very 
metaphysics which led him to his fresh and challenging re- 
statement of the problems of psychology. 
R. B. MacLeop 


Swarthmore College 














THE CONCEPT OF ABILITY 


BY H. A. CARR AND F. A. KINGSBURY 
The University of Chicago 


The concept of ability is of popular origin, and it has long 
been used in every day life. This popular concept has lately 
been introduced into psychology by way of applied psy- 
chology. Practically no reference to the concept can be found 
in the standard text books at the beginning of this century. 
At the present time nearly all such texts make some use of 
the concept. In the experimental literature we find many 
excellent studies of various abilities. Mechanical ability, 
musical ability, salesmanship ability, and general intellectual 
ability areexamples. According to present trends of develop- 
ment, the concept promises to become one of the major 
categories of psychology. Psychology has long been ac- 
customed to borrow and use popular concepts, and it often 
does this without any attempt to analyze and define them in 
any precise manner, and yet such a definition is essential if 
they are to be adopted as one of the major categories of the 
science. This paper proposes such an analysis with some 
additional suggestions concerning the place of this concept in a 
systematic psychology. 

Kinds of ability—Each ability implies a corresponding 
act, and it is named in terms of that act. There are thus as 
many abilities as there are acts which individuals perform. 
Most people have the ability to hear, to see, to taste, and to 
feel objects, the ability to walk, to run, to jump, and to talk, 
and the ability to think, to make decisions, to love and to 
hate. We are accustomed to speak of golfing ability, musical 
ability, stenographic ability, executive ability, memory ability, 
mechanical ability, and mathematical ability, and this list can 
be indefinitely extended. 

Each ability implies a corresponding act, and its existence 
is inferred from the observation of that act. The ability is 
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thus something different from the act. The act is something 
which the individual does, and it can be observed. The 
ability is something which the individual has or possesses, and 
it is something which can not be observed; its existence is 
inferred from the observed act. The ability is thus a concep- 
tual object. It is an attributive definition of the reactive 
nature of the individual, and we are accustomed to define 
the reactive nature of the individual in part in terms of what 
abilities he possesses. 

The meaning of the concept may be further analyzed by 
comparing individuals in respect to the presence and absence 
of certain abilities. Some people have the ability to move 
their ears and other have not. Some people can speak 
French and some cannot do so. Birds have the ability to 
fly, while man does not. Many such illustrations can be 
given. An ability refers to an act whose occurrence is a func- 
tion of the constitution of the organism. An individual’s 
abilities refer to those organic conditions upon which the 
occurrence of the act is contingent, and these organic condi- 
tions may be anatomical or physiological, and innate or 
acquired in respect to origin. In the absence of these condi- 
tions, the performance of the act is impossible. Without 
wings, the organism cannot fly. With the destruction of the 
inner ear, the ability to hear is lost. Ability refers to the 
dependence of the acts upon the nature of the organism, and 
since the constitutional nature of ‘the organism is something 
that endures for some time, an act that is performed once can 
generally be repeated on the next day and the day thereafter. 

The concept of ability thus has a threefold reference. 1. 
It refers to the observed act from which its existence is in- 
ferred and in terms of which it is named. 2. It is also a pre- 
dictive concept in that it refers to the possibility of repeating 
the act on the next day and the day thereafter. 3. It is also 
an attributive definition of the reactive nature of the indivi- 
dual in virtue of his possession of those constitutional condi- 
tions upon which the occurrence of the act is contingent and 
which render possible its future performance. 

The question as to what abilities an individual possesses 
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can hardly be regarded as a problem, for an individual neces- 
sarily possesses the ability to do whatever he is observed to do. 
The observed performance of the act is prima facie evidence 
of the existence of the corresponding ability. If this were the 
only question at issue, the concept of ability would have no 
distinctive utility in either popular or scientific thought, and 
perhaps the concept would never have been evolved. 

Amount of ability—A number of individuals may each 
possess the same ability, but possess different amounts thereof. 
Abilities are thus said to vary in amount from individual to 
individual. It is this question of the relative amount of 
ability possessed by different individuals that is of practical 
significance, and in every day life we are frequently making 
judgments of our acquaintances in respect to their abilities 
as golfers, dancers, musicians, salesmen, teachers, and 
executives. It is also this question of amount of ability with 
which psychology is primarily concerned. 

Any judgment or measurement of amount of ability is 
necessarily based upon some quantitative feature of the 
corresponding act. While we sometimes say that one indi- 
vidual has more golfing or musical ability than another, our 
judgments are more often couched in terms of a quantitative 
scale of values as poor, mediocre, average, good and excellent. 
Such evaluative judgments imply a standard of reference. 
Our judgments of amount of ability are based upon the effi- 
ciency of the act in attaining some end result, and in the popu- 
lar use of the concept the end result is one that society regards 
as valuable and important. The nature of this socially ap- 
proved end varies with the act. With many acts the end 
product is quite definite and easily measured. In golf the 
efficiency of the act is arbitrarily measured in terms of the 
number of strokes, and not in terms of the amount of time 
required to go around the course. Sprinting ability is mea- 
sured in terms of speed. Sales ability is usually measured in 
terms of amount of sales. Typing ability is measured in 
terms of speed and accuracy, and to some extent in terms of 
neatness. With some abilities the end product is quite com- 
plex and intangible in character and not easily measured. 
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Dancing, musical and executive abilities are pertinent 
examples. 

Factors influencing efficiency—The amount of ability is 
thus measured by means of efficiency scores. Not all effi- 
ciency scores, however, can be used as measures of ability, 
for an individual’s efficiency is a function of many other 
factors than ability. An individual’s ability can be measured 
only in terms of those efficiency scores which are obtained 
under such experimental conditions that the influence of 
these other factors is excluded. The nature of these factors 
will be briefly illustrated with speed of typing as an example. 

1. Objective conditions.—It is a truism that one can type 
familiar, meaningful material at a faster rate than a similar 
amount of tabular material. Speed also varies with the length 
of the material. Two pages can usually be typed at a faster 
rate than one hundred pages of similar material. One can 
usually type faster from memory than from dictation. A 
quiet environment is usually regarded as conducive to speed 
as compared with a distractive environment. Speed may 
vary with all of the variable objective conditions in which 
the act occurs. 

2. The objective, volitional attitude, and motivation of the 
individual.—In typing an individual may strive to attain the 
maximum of either speed or accuracy. Speed is usually 
sacrificed in the attempt to attain accuracy, and accuracy 
is usually decreased when speed is the prime objective. Speed 
of typing may thus vary with the objective of the individual. 
Within certain limits an individual can voluntarily control 
his level of efficiency. An individual may be typing at an 
habitual rate, and then materially increase this speed by 
attempting to do his best, and again this maximal volitional 
rate may be further increased by subjecting the individual te 
the motives of rivalry and monetary rewards. 

3. Fluctuating mental and organic conditions.—It is 
evident that the speed of performance may be materially 
influenced by many of the unusual and temporary mental and 
organic conditions of the subject. The potential influence of 
various emotional disturbances, headaches, fatigue, loss of 
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sleep, and the effect of drugs may be cited as pertinent 
examples. 

4. Chance factors.—Speed is also subject to a wealth of 
chance or uncontrollable factors and these may be both ob- 
jective and intraorganic in character. The amount of in- 
fluence which these factors exert will probably vary with the 
individual and the act in question. They will probably exert 
but little influence upon the typing performance of a trained 
subject as compared with that of a novice. Speed of typing 
will probably be less affected than will such an act of skill as 
golf. Itis commonly assumed that maze scores are peculiarly 
subject to this group of factors. ‘These factors are invariably 
present, and all performance scores are subject to their 
influence. 

5. Constitutional conditions.—Finally speed of typing is a 
function of the constitutional nature of the individual. The 
term constitutional refers to all those relatively permanent 
and enduring organic conditions that characterize each 
individual and differentiate him from his fellows. It is the 
enduring character of these constitutional conditions that 
differentiates them from those highly fluctuating and tem- 
porary conditions listed in class 3 above. It may be noted 
that these constitutional conditions may be either innate or 
acquired in respect to origin. Speed of typing is, to a large 
extent, acquired, i.¢., it is a function of those organic modifica- 
tions resulting from previous practice. On the other hand, 
speed of running for the longer distances is to a considerable 
extent a function of the stamina and endurance of the indi- 
vidual, and these characteristics are generally regarded as 
partly innate. 

Measurement of ability—The amount of ability which a 
given individual possesses can be defined only in terms of 
more or less than that possessed by other individuals. 
Ability, when subjected to measurement, is a relative term. 
The amount of a given individual’s ability must be quantita- 
tively defined in terms of a scale of values that denotes his 
position within some standard group of individuals. 

The relative amounts of ability possessed by the members 
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of a group of individuals are indirectly measured by means of 
that distribution of efficiency scores that is known to be a 
function of the fifth group of factors—the constitutional 
differences of these individuals. Individuals differ in amount 
of ability just in so far as their efficiency of performance is a 
function of their constitutional differences. As the first step 
in measuring ability, the experimenter must secure a group of 
efficiency scores under such conditions that their distribution 
can not be explained in terms of any of the first four groups of 
factors. 

1. The objective conditions under which the scores are 
obtained must be exactly alike for all members of the group. 
In measuring typing ability, the efficiency scores must be 
obtained for the same kind and length of material, and for 
the same environmental conditions. If all of the objective 
conditions of the test are alike for all members of the group, 
none of the differences in the obtained distribution of scores 
can be due to any differences in those conditions. 

2. The members of the group must also be equated in 
respect to the attitudinal group of factors. This equation 
can be attained only by means of the experimental instruc- 
tions. A. To avoid any ambiguity of interpretation, the 
objective of the test should be specifically stated. Often the 
objective of the test is taken for granted, and any such state- 
ment is omitted from the instructions. This procedure may 
be justified in the case of all tasks that have a single and well 
known objective as in the case of golf. Many tasks like type- 
writing have several objectives, and in these cases the nature 
of the objective should be specifically stated. Otherwise 
some individuals may strive for speed, others for accuracy, 
and others for some combination of the two. #B. Naturally 
the experimenter must score the performances in respect to 
the stated objective. If all of the individuals were instructed 
to attain speed, it is obvious that a set of accuracy scores 
could not be used as a valid means of measuring their relative 
ability either to type speedily or to type accurately. C. 
Finally all members of the group must be instructed to do 
their best, and the experimenter must attempt to secure their 


cooperation in so doing. 
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The most serious difficulty encountered in the measure- 
ment of ability is the equation of the group in respect to degree 
of motivation. Practically all flagrant errors in these mea- 
surements can be traced back to this source. The degree to 
which the subject is motivated by the experimenter’s request 
to do his best is to some extent a function of their personal 
relation, and the nature of this will vary considerably from 
subject to subject. The experimenter must thus seek to 
establish cordial relations with each subject tested. The 
degree to which an individual is motivated to do his best is 
also a function of his attitude toward the test, and this attitude 
will vary with the subject. The incidence of this factor may 
be illustrated in terms of a test of intelligence. Intelligence 
test scores are often used as administrative devices in our 
colleges. It is known that some students refrain from doing 
their best in these tests in order to escape from the subsequent 
administrative pressure. Some individuals are inclined to 
regard many of the test items as silly and trivial in character, 
and hence are indifferent and careless in their responses to 
these items. Some individuals are inclined to be skeptical of 
the validity and value of all test procedures, and hence do not 
react to them in a serious manner. Practically all subjects 
know that these test scores are to be used for comparative 
purposes, and individuals are motivated differently by this 
factor of competition. The desire to excel motivates many 
individuals to their best efforts, while other subjects are but 
little concerned as to their relative standing in this type of 
competitive situation, and hence are not susceptible to this 
form of motivation. 

3. The differential influence of the group of intra-organic 
factors is usually eliminated by excluding those individuals 
from the test who possess these factors or by postponing the 
test until the individuals have recovered. A few individuals 
are emotionally disturbed when first confronted with a test 
situation. Such disturbances can usually be allayed by a 
tactful experimenter. There is also the occasional individual 
who is skeptical of his ability, and who is fearful that the test 
may reveal his deficiencies. Usually the resultant emotional 
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disturbance detrimentally affects his scores. Such cases are 
extremely difficult to handle. 

4. The usual device for detecting the presence and 
eliminating the influence of the chance group of factors is the 
method of repeated administration of the test. The test must 
be repeated under such conditions that the scores of the 
individual will not be affected by their success or failure in the 
previous administration. 

The logic of the method is obvious. If the group distribu- 
tion of scores is wholly a function of the constitutional and 
hence enduring differences of the individuals, the correlation 
between the scores for two administrations will approximate 
+ 1.00. The presence of the chance factors, however, will 
operate to decrease the size of the correlation. The size 
of the obtained correlation is thus a function of the relative 
influence of the two groups of factors on the distribution of 
the scores. Therefore the correlation of the scores for two 
administrations of the test is computed. If the correlation 
approximates + 1.00, the influence of the chance factors is 
negligible in amount, and the obtained distribution of scores 
may be used as indices of the relative ability of the members 
of the group. 

A low correlation, on the other hand, indicates that the 
chance factors are influentially present. In this case, the 
influence of these factors is balanced or equated by securing 
for each individual an average score for a number of repeated 
administrations, and it is the distribution of these average 
scores that is used to measure the relative ability of the mem- 
bers of the group, for it is these average scores that most 
closely approximate that degree of efficiency that can be 
consistently repeated. The number of administrations neces- 
sary to secure an effective equation of the chance factors is a 
function of the size of the correlation, or of the degree to which 
these factors influence the records. 

The amount of each individual’s ability must next be 
numerically defined in terms of his position within the group. 
There are two approved methods for so doing. 1. The 
efficiency scores for the group are arranged in order of magni- 
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tude. They are then divided into a number of groups that 
are equal in respect to number of individual scores. 'The 
amount of each individual’s ability is numerically defined in 
terms of the ordinal value of the group to which he belongs. 
It is customary to employ a hundred sub-divisions, and this 
gives a scale of ordinal values ranging from 1 to 100. A 
definition of an individual in terms of this scale denotes his 
percentile rank in the group. 2. The range of scores is 
divided into a number of equal steps, and the individual is 
numerically defined in terms of the ordinal values of these 
steps. The size of the units and hence the number of steps is 
a matter of arbitrary choice. The unit that is generally 
employed is the sigma of the distribution inasmuch as this 
procedure gives comparable units for different abilities. 

The concept of ability —In measuring the amount of ability, 
we again encounter the threefold reference of the term. 1. 
An individual’s ability refers to his maximal efficiency of 
performance which was subjected to measurement. 2. It 
is a predictive concept in that it refers to the possibility of 
repeating this performance on some future occasion. Without 
this predictive reference, we would merely speak of what 
individuals have done without any reference to what they 
may be expected to do in the future. 3. The term is also 
used to define the reactive nature of the individual in virtue 
of his possession of those constitutional conditions upon which 
the measured degree of efficiency is contingent and which 
render possible its future repetition. In other words, our 
individual is a person of such a nature that he not only has 
exhibited this degree of efficiency in times past, but he is one 
who can be expected to repeat this performance on future 
occasions. 

We have noted that two questions arise in any discussion 
of this topic. 1. What abilities does an individual have? 
2. What amount of each of these abilities does he possess? 
These two questions need to be carefully distinguished in any 
discussion of many of the implications of the concept. 

The presence of a given ability in an individual can be 
decided on the basis of an observation of that individual’s 
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behavior. To measure the amount of that individual’s 
ability, his behavior must be compared with that of some 
standard group. Ability, when subjected to measurement, 
becomes a relative term. 

When we refer to the presence of an ability, the nature of 
that ability is named and specified in terms of the correspond- 
ing act. The ability that is subjected to measurement, how- 
ever, must be further specified in terms of the end which the 
individual is attempting to attain and in terms of which his 
efficiency is measured. In the first case we speak of typing 
ability, but it is the ability to type speedily that is measured. 
As will be developed later, it is also necessary at times to 
specify the nature of the ability that is measured in terms of 
some of the conditions under which the act is performed. 
We may find that the ability which we are measuring must be 
specified as the ability to type accurately in distractive sur- 
roundings, or the ability to type rapidly from dictation. The 
relative efficiency of a group of individuals may so vary with 
some of these conditions that we find that we are dealing with 
two relatively independent abilities. For example, it is well 
known that relative speed of running varies with distance, 
and it is for this reason that we are accustomed to differentiate 
between sprinters and distance runners. 

An individual’s ability is contingent upon some of the 
constitutional and enduring conditions of the organism. 
How long do these conditions endure? It is obvious that the 
evidence for the continued existence of these organic condi- 
tions must be derived from the observation of the occurrence 
of the corresponding act. 1. When we are dealing with the 
presence of an ability, we may say that the ability when once 
present usually persists throughout life. Abilities can be 
lost as a result of accidents and disease. Blindness, deafness, 
paralysis, and the various aphasias are illustrations. The 
ability to speak a language acquired in early life may be lost 
with certain changes in cultural environment. But abilities 
tend to persist throughout normal conditions of life. 2. A 
different answer must be given to the question of the persis- 
tence of a given amount of ability, or a given level of ef- 
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ficiency. Levels of efficiency are known to vary with a variety 
of circumstances, and no categorical answer to the question 
can be given. A few illustrations will suffice. Maximal 
levels of efficiency in some skills decrease with age. An 
individual who has attained the ability to run 100 yards in 
9.7 seconds cannot long retain this level of efficiency in spite 
of training and every effort to keep in condition. A major 
league baseball efficiency is also of short duration. Lower 
levels of efficiency in these and other skills can be retained for 
longer periods of time. Levels of efficiency that are primarily 
functions of special training will vary with further oppor- 
tunities for practice. Low levels of efficiency may remain 
relatively constant or improve with age, while high levels of 
efficiency will tend to disintegrate without a considerable 
amount of intentional practice. The concept of ability, 
however, is of popular origin, and its predictive value is its 
main function. Constitutional conditions do change, but 
they change slowly. Any attained level of efficiency usually 
endures for a sufficient duration of time to serve all practical 
purposes of prediction. 

Unity and complexity.—Is personality a unitary or complex 
object? The same question may arise in reference to ability. 
Ability and personality, like all conceptual objects, are both 
unitary and complex in character. Unity and complexity 
are relative terms. A unity implies parts that are united, and 
a thing that is complex must also exhibit unity or we would 
not term it a thing. 

An ability, when measured, must be specified in terms of 
the end which the individual strives to attain, and both the 
act and the derived concept of ability are uuitary when 
regarded from the standpoint of that end. An ability is 
necessarily a unit in so far as we regard it as an object of 
thought and give it a name. 

While the act to which the concept refers is a unit regarded 
from the standpoint of the end attained, yet it also exhibits 
a considerable degree of analytical complexity. The act of 
typing involves the coordination of practically the entire 
sensory and motor resources of the organism. The act can 
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thus be conceptually analyzed on the basis of its sensori- 
motor components. The act may also be analytically sub- 
divided into various temporal components that are of func- 
tional importance, such as the purposive set of the individual, 
the perception of certain units of the material, its memorial 
retention, and the reaction of the fingers to this material. 
Typing ability may thus be regarded as the integrated com- 
posite of such things as manual dexterity, the ability to read, 
the ability to coordinate the hands and eyes, the ability to 
keep our minds on the task, and the ability to maintain a 
favorable bodily posture. 

The term is used to define the constitutional nature of the 
individual as revealed in his efficiency of performance. The 
concept thus denotes all of those constitutional characteristics 
of the organism that condition the efficiency of the correspond- 
ing act. Each individual possesses a multitude of abilities, 
and each is denoted by a distinctive name. However, we 
must avoid the naive tendency to assume that each of the 
corresponding constitutional conditions is therefore simple and 
unanalyzable in character, and occupies a distinctive location 
within the organism. Since most acts are quite complex and 
involve the entire organism, we must assume that their organic 
conditions are also complex and variously located throughout 
the organism. The concept is thus complex in respect to its 
organic reference. The organic conditions underlying each 
ability are unitary only in respect to their functional relation 
to the act in question. 

According to the above conception, practically all abilities 
will be intimately related to each other in respect to their 
organic conditions. It does not follow, however, that each 
pair of abilities will necessarily exhibit a positive or negative 
correlation. <A zero correlation between two abilities does not 
necessarily mean that they are unrelated on the organic side. 
Two abilities may be intimately related with each other on 
the organic side, and yet the nature and amount of this 
relation may so vary from individual to individual that a zero 
correlation is obtained for a group of individuals. A high 
positive correlation merely means that the nature of the 
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organic relation is relatively constant from individual to 
individual throughout the group. 

Degree of universality—Universality and generality are 
two terms that are often confused. Universality is here 
used to refer to the extent to which an ability is distributed 
throughout the population. Abilities are thus universal in 
proportion to the number of individuals who possess them. 

So far as the presence of an ability is concerned, the vari- 
ous abilities differ materially in respect to their degree of 
universality. Some abilities are quite restricted in their 
distribution, while others are so widely distributed that they 
approximate the maximum degree of universality of 100 per 
cent. A few illustrations will suffice. It is generally assumed 
that all psychological individuals—both human and animal— 
possess the ability to learn. In fact, the existence of this 
ability has often been proposed as one of the distinguishing 
characteristics of a psychological organism. Locomotion is 
another highly universal ability, while the ability to walk as 
distinct from swimming, crawling, and flying is more limited 
in distribution. The ability to talk is practically universal 
so far as human individuals are concerned, while the ability 
to speak French or English is more limited in distribution. 
The abilities to read cuneiform inscriptions and to solve 
differential equations are still more limited in distribution, and 
thus possess a low degree of universality. 

The assertion is sometimes encountered that the ability to 
speak French or English is universal because all people can 
acquire this skill. According to this conception practically 
all human abilities are universal. The conception fails to 
differentiate between attainments and potentialities, between 
the ability to perform a given act and the ability to learn that 
act. The ability to speak English and the ability to learn 
to speak English are two different things. 

When we deal with the amount of ability, the only sig- 
nificant question is the relative distribution of various amounts 
of the same ability. Factual data are largely lacking. For 
statistical purposes, however, it is generally assumed that the 
various amounts of each ability are distributed in accordance 
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with the normal probability curve. According to this con- 
ception it is the average level of efficiency that is most widely 
distributed throughout the population, while the extremely 
high and the extremely low levels of efficiency are possessed 
by relatively few individuals. 

General ability—The concept of general ability is often 
encountered in the literature. Much of the discussion of this 
topic is confusing and ambiguous in import. This confusion 
is largely due to the fact that the term general has three 
distinct popular meanings. 

1. One popular meaning is that discussed in the previous 
topic of universality. Generality refers to the distribution of 
an ability throughout the population. An ability is general 
in proportion to the number of people possessing it. The 
ability to speak English is thus more general than the ability 
to speak Sanskrit. It is to be noted that we are here dealing 
with the problem of the presence or absence of an ability, 
that abilities differ in degree of generality, and that every 
ability of any consequence possesses some degree of generality. 
However, we have preferred to follow the customary pro- 
cedure of most psychologists in using the term, universality 
to denote this attribute of ability. 

2. Abilities are also general in respect to some logical 
scheme of classification. Society has not only identified and 
named a great variety of abilities, but it has also arranged 
them in various classes and subclasses. We are not concerned 
with the validity of any particular schema of classification. 
We merely note that such popular classifications do exist, 
and for the purpose of this paper we shall cite two illustrations. 
Locomotor ability is a class term that includes the abilities 
to walk, to swim, to fly, and to crawl. Athletic ability is a 
class term that denotes such items as golf, tennis, boxing, 
swimming, fencing, football, baseball, and the various track 
and field events. Track ability is a subclass that includes 
the various distance and hurdle events. Ability in field 
athletics is another subclass that includes pole vaulting, 
javelin throwing, and the various jump and weight events. 
Ability in football and baseball may also be regarded as 





368 H. A. CARR AND F. A. KINGSBURY 


subclasses because there are several recognized forms of each 
of these sports. 

A. We shall first deal with the presence of this type of 
general ability. Some class terms denote general abilities 
while others do not. A class term denotes a general ability 
when its various particular abilities are the common posses- 
sion of some individual or group of individuals. On the 
other hand, a class term is not a general ability when its 
various particulars belong to different individuals or groups 
of individuals. The distinction may be readily illustrated in 
terms of locomotor and athletic abilities. 

Organisms differ in respect to their mode of locomotion. 
Some organisms traverse space by crawling, and others by 
swimming. Other organisms can walk but not fly, while some 
can both fly and walk. No one organism, however, exhibits 
all four modes of locomotion. Hence locomotor ability is 
purely a class concept, for no organism possesses this ability 
in the generic meaning of the term. 

It is generally assumed that every normal human indi- 
vidual can participate with some degree of efficiency in all 
athletic sports, and hence we assume that they each possess 
some degree of golfing, tennis, boxing, baseball, and track 
ability irrespective of how good or poor those abilities may be. 
Since all individuals possess in some degree each of these 
particular abilities, they therefore possess an athletic ability. 
Athletic ability is thus not only a class term or concept, but 
it also denotes a type of general ability which individuals 
actually possess. 

B. There is also the question of the existence of a given 
amount of ability throughout a class of activities. We ob- 
serve an outstanding performer in some sport such as golf. 
This individual is also known to be an excellent performer in 
a number of other sports. Under these conditions, we assume 
that this level of excellence probably obtains for all athletic 
skills, and we judge that our individual is a natural all around 
athlete—an individual who possesses a high degree of athletic 
ability. Individuals may likewise be characterized as poor or 
mediocre athletes. 
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An individual’s level of efficiency is never exactly the same 
for all sports. His relative efficiency within a group will vary 
to some extent from activity to activity, and the amount of 
this variation will differ for individuals. Our popular judg- 
ments of the existence of a certain amount of general ability 
are thus concerned with levels of efficiency that exhibit some 
unspecified degree of consistency, or with levels of efficiency 
whose variations do not exceed a certain unspecified amount. 
This popular concept of general ability thus involves a quanti- 
tative criterion. If this concept is to be used for scientific 
purposes, it is evident that it must be arbitrarily defined in 
statistical terms. Moreover, the number of individuals who 
possess a general ability of this sort will vary with the quanti- 
tative criterion adopted. 

3. Every ability possesses some degree of generality in 
respect to the variable circumstances in which the act may 
occur. A group of individuals is tested for speed of typing 
under certain standard conditions. ‘These measured amounts 
of ability are general in so far as these individuals maintain 
their relative standing for all of the variable conditions that 
are encountered in professional work. The relative golfing 
ability of a group of individuals is general in so far as they 
maintain their relative ranking irrespective of the nature of 
the course, weather and climatic conditions, and the presence 
and absence of a gallery. A given amount of ability is thus 
general in respect to all those variable circumstances which 
do not materially affect it. 

Our knowledge of the degree of generality of any ability 
must be based upon empirical data. The typing efficiency of 
a group of individuals is measured for quiet and for distrac- 
tive conditions. Let us assume that the correlation between 
the two sets of scores approximates zero. Under these condi- 
tions, typing speedily under distractive conditions and typing 
speedily in quiet surroundings are two independent abilities 
that are to be subsumed under the class concept of typing 
ability. Each of these abilities, however, is still general in 
respect to all other variable conditions such as amount and 
kind of material. The empirical data may thus necessitate 
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the subdivision of any hypothetical ability into a number of 
abilities whose distinctive nature must be further specified 
in terms of their.limiting conditions. No matter how specific 
may be the abilities that are thus isolated, each will neces- 
sarily possess some degree of generality. Every ability, as 
previously stated, denotes a level of efficiency that can be 
repeated in the future, and hence by definition it must at 
least possess some degree of generality in respect to time. 

The necessity of subdividing certain hypothetical abilities 
on the basis of empirical data may be illustrated with speed of 
running. In track athletics, it is a matter of common knowl- 
edge that relative speed of running is to a considerable extent 
a function of the variable factor of distance. First class 
sprinters usually make relatively poor records in the distance 
events, and outstanding distance runners are usually poor 
performers in the sprints. No runner is an outstanding 
performer in all events. As a consequence, speed of running 
has been subdivided into three relatively independent abilities, 
and racing contestants are classified as sprinters, middle 
distance runners, or distance runners. 

We are thus confronted with three kinds of generality: 
I, generality in respect to the population; 2, generality in 
respect to a class of activities; and 3, generality in respect to 
the variable circumstances in which the act may occur. 
We have previously noted that ability is a predictive concept. 
We may now call attention to the fact that this concept is 
predictive just in so far as it is general. Abilities may differ 
in degree of generality, and the greater the degree of general- 
ity, the greater is the range of possible prediction. The 
direction or character of the prediction is a function of the 
kind of generality that obtains. We can predict within a 
classificatory system only in so far as the ability is general in 
this respect, and we can predict from one set of circumstances 
to another only in so far as the ability possesses this kind 
of generality. For practical purposes of prediction, it is 
evident that it is the third kind of generality that is most 


important. 
Learning ability —Learning ability is distinctive in that 
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it is a genetic or developmental concept as contrasted with all 
other abilities which may be arbitrarily called performance 
abilities. Learning ability differs from the performance 
abilities in respect to the goal or objective which the indi- 
viduals are striving to attain and in terms of which their 
efficiency is measured. The goal of a performance ability is 
the immediate output of the act—amount of material per unit 
of time in the case of speed of typing. In learning, the goal 
is improvement of output from performance to performance. 
The individual strives to reach a certain level of efficiency 
with as few practice trials as possible, or to secure the maxi- 
mum improvement possible in a given amount of practice. 
The goal of learning is future efficiency of performance, and 
learning is thus a developmental concept. 

Each individual can learn a great variety of things, and 
each act of learning can occur under various temporal and 
spatial circumstances. Obviously the term learning ability 
possesses some general significance. Is learning ability 
general in the sense that individuals differ consistently in 
respect to rate of learning irrespective of what is learned and 
the circumstances under which the process occurs? In 
the literature there is much discussion of the existence of this 
type of general ability, but there are no adequate and decisive 
data upon which to base a confident decision. So far as is 
known to the contrary, this type of learning ability, like 
many other abilities of which we talk so glibly, may denote a 
purely abstract and conceptual entity that has no more con- 
crete existence than a dog in general. 

Before discussing the next topic, it is necessary to note 
briefly the distinction between the intentional and the inci- 
dental types of learning. Human subjects, in any laboratory 
experiment, must necessarily be given some instruction as to 
the task which they are expected todo. Ina learning experi- 
ment, the subjects are instructed to learn as quickly as pos- 
sible. Learning under these conditions is termed intentional 
learning. The intent to learn is one of the essential condi- 
tions involved in the measurement of learning ability, for the 
relative learning ability of a group is measured in terms of 
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rate of improvement when the individuals are attempting to 
learn as quickly as possible. 

The reaction experiment may be used to illustrate the 
incidental type of learning. The subjects are instructed to 
react as quickly as possible. In such an experiment, a large 
number of records distributed over several weeks may be 
secured for each subject. When these speed records are 
analyzed, considerable improvement from day to day may be 
detected. The improvement that occurred was an incidental 
by-product of the task. All improvement due to practice, 
when the intent to learn is absent, is termed incidental learn- 
ing. Intentional and incidental learning thus differ in respect 
to the presence and absence of the intent to learn. 

Learning ability and the I. Q.—The I. Q. is a concept 
whose meaning must first be defined in terms of the operations 
by which it is measured. When viewed from this standpoint, 
the I. Q. is a genetic or developmental concept. We shall use 
the Stanford-Binet test for purpose of illustration. 

The Stanford-Binet test consists of several groups of test 
items arranged in an age scale. The group of items for each 
age represents the maximal ability for the average child of 
that age. All of the items are tasks that are learned. All 
items are products of the incidental type of learning. The 
various groups of items thus represent successive levels of 
learning achievement, and hence the test may be regarded as a 
learning or developmental scale in which the successive steps 
are all equal in respect to the time required for the average 
child to progress from one level to another, 97z., one year. 

A number of children are subjected to the test, and they are 
each scored in terms of a mental age. The group of mental 
age scores thus represents individual differences in amount of 
learning achievement. The various test items were so se- 
lected that the opportunities for learning them are presumably 
equal for all of the children with the single exception of time, 
i.¢., the age of the child. To eliminate this time or age factor, 
the mental age score of each child is divided by its chrono- 
logical age. The resultant quotients are the I. Q. scores, and 
these scores are thus usually regarded as measures of those 
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differences in rate of incidental learning that are functions of 
innate differences between the individuals. 

Learning ability and the I. Q. are closely related concepts. 
Both are genetic and predictive concepts. Differences in 
learning ability are measured in terms of those differences 
in rate of intentional learning that are known to be functions 
of the constitutional differences of the individuals, and those 
differences are innate and acquired in respect to origin. The 
I. Q. scores are usually regarded as measures of those differ- 
ences in rate of incidental learning that are functions of 
innate differences of the individuals. 

In measuring the I. Q. and likewise in measuring learning 
ability, we are studying relative rate of learning under a 
particular set of circumstances. The I. Q. is a measure of the 
relative rate of learning a selected set of test items. Learning 
ability is concerned with relative rate of learning some selected 
material such as a list of words or a stylus maze of a given 
pattern. Presumably each set of data is valid for the particu- 
lar set of circumstances involved. Such data, however, are 
of value only in so far as they have some general significance— 
only in so far as they can be regarded as typical of what oc- 
curs in all of the varied circumstances of life. Which of the 
two sets of measures possesses the greater degree of general 
significance? Several comparisons are possible. 

1. The I. Q. scores are measures of relative rate of learning 
over a period of years. The time required for learning a 
stylus maze or a list of words is usually less than an hour. 
Obviously the I. Q. values are more typical and representa- 
tive in this respect. 

2. The I. Q. scores measure rate of incidental learning. 
Measures of learning ability involve intentional learning. 
The occurrence of the intentional type of learning is largely 
limited to the laboratory and the schools. The incidental 
type of learning goes on all the time. The larger part of our 
development is a product of incidental learning. The I. Q. 
scores are thus more typical in respect to type of learning. 

3. In measuring learning ability, a relatively restricted 
and homogeneous material is usually employed. Neither a 
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list of words nor a stylus maze can be regarded as a typical 
sample of the great variety of things that we learn in every 
day life. In constructing the test, the items were selected 
so as to secure the maximum of variety and heterogeneity of 
material possible, and to choose for each group those items 
that are typical of children’s activities at that age level. The 
I. Q. scores are the more general in respect to the materials 
employed. 

4. The I. Q. scores are supposed to measure those differ- 
ences in rate of learning that are mainly functions of innate 
differences of the individuals. Presumably these differences 
will underlie and hence differentially affect the learning of all 
materials. Innate conditions that affect rate of learning are 
thus likely to exert quite a widespread and pervasive effect 
upon rate of development. Studies of transfer, on the other 
hand, show that the residual effects of learning one task often 
exert a much more restricted effect upon subsequent develop- 
ment than the old doctrine of formal discipline assumed. 

5. Under ordinary conditions of life, a child’s I. Q. re- 
mains approximately constant throughout life. This fact 
has been empirically established. While many fluctuations 
are encountered, yet a high degree of constancy is the rule. 
On the basis of the I. Q. scores of a group of children obtained 
in early life, it is thus possible to predict their relative status 
in later years with a considerable degree of accuracy. It may 
be noted that this prediction is independent of the age factor. 
This independence is due to the fact that we are dealing with 
an age scale. A child of six has an I. Q. of 105. This score 
primarily denotes that this child stands 5 points above the 
norm for his age group. This age scale was so constructed 
that an I. Q. of 100 is the norm for all age groups. Conse- 
quently our hypothetical child has an I. Q. which is § points 
above the norm of all age groups. 

This prediction irrespective of age cannot be made on the 
basis of measures of learning ability. Experiments have 
abundantly established the fact that memory ability increases 
quite rapidly up to maturity and then remains approximately 
constant for some time thereafter. Two groups of children 
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of 8 and 12 years each memorize a list of words. The average 
score for the two age groups will differ considerably, but will be 
approximately the same 14 years later. Predictions of future 
status irrespective of age differences are possible only when 
these measures of memorizing ability are transformed into an 
age scale. 

6. Experiments have shown that the I. Q. scores exhibit 
some degree of positive correlation with a wide variety of 
both learning and performance types of ability. This fact 
again indicates that these scores are highly general in their 
significance. 

This type of measurement was originated and first used for 
practical purposes, and as a consequence the concept was long 
held in rather poor repute. Because of the recognition of its 
general and developmental significance, the concept has at- 
tained a considerable degree of respect, and there are some 
indications that it may be quite generally adopted as one of 
the major categories of psychology. For these reasons the 
I. Q. is often defined as a measure of learning ability, as a 
measure of intelligence, and as a measure of rate of mental 
growth. 

A ptitude.—Aptitude is another popular term that is used 
to define the reactive nature of an individual. Aptitude is 
similar to learning ability in that it is a genetic and predictive 
concept. 

These aptitude judgments arise in the following types of 
situation. An individual is observed to exhibit an unusually 
high level of initial performance in some line of activity such 
as music or golf. This initial level of performance cannot be 
due to special training or practice. Again an individual may 
exhibit an unusual amount of improvement on the basis of a 
small amount of initial practice. Under these conditions the 
individual is judged to possess some special aptitude for these 
skills, and these judgments imply the belief that this indi- 
vidual, with the requisite training and practice, will become an 
outstanding performer in this line of activity. The term is 
thus used to predict future level of performance on the basis 
of initial performance. 
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Ability as a psychological concept.—Ability is an important 
concept for educational and industrial psychology. It is 
particularly important from the standpoint of placement and 
personnel. Is its value limited to applied psychology, or is it 
also significant from the standpoint of systematic psychology? 

A few questions may be proposed. What abilities shall be 
studied, what knowledge of these shall be sought, and how 
shall this knowledge be integrated into a systematic account 
of mental life? A great variety of abilities have been identi- 
fied and named. Shall all abilities be subjected to study? 
Shall a few typical abilities be selected for intensive study in 
the hope of obtaining some general principles that are ap- 
plicable to all? Shall the field be simplified by factorial 
methods, and systematic investigation be limited to the so- 
called primary abilities? Shall a systematic psychology 
limit its attention to those of a distinctively psychological 
character such as the intellectual class of abilities? Is a 
systematic type of psychology concerned only with such 
genetic concepts as aptitudes, the M. A. and the I. Q., and 
learning ability? 

We shall not attempt to answer these questions, but we 
may note that many of our introductory texts do devote a 
chapter to this topic. These accounts are mainly concerned 
with tests of, or methods of measuring, ability. The ac- 
counts are largely limited to tests of intelligence with a few 
allusions to tests of stenographic, mechanical, and musical 
ability. The topic is treated primarily from the standpoint of 
applied psychology, and the material presented exhibits no 
very obvious logical relation to the rest of the book. 


[MS. received March 12, 1938] 





CHANGING VIEWS OF PSYCHOLOGY AS SCIENCE! 


BY ARTHUR G, BILLS 
University of Cincinnati 


Psychology, one of the youngest of the sciences, is rapidly 
becoming the most consciously self-analytical and self- 
critical. Unlike the traditionally pampered and spoiled 
youngest child of the family, it has been constantly overawed 
by the superior accomplishments of its older sister sciences, 
until it has been forced, by the necessity of compensating for 
a deep-rooted inferiority complex, to become keenly self- 
analytical of the bases for its claims to the status of a science. 
It is in the light of this extensive criticism that I wish to dis- 
cuss the topic ‘Changing Views of Psychology as Science.’ 
The title is obviously question-begging, since it implies that 
our views are undergoing a change as to what constitute the 
criteria of a scientific psychology. This is in fact my thesis, 
and it will lead me to an attempted re-evaluation of those 
basic criteria to determine which of the traditional ones have 
already been outgrown, and can be discarded, which are still 
essential, and finally how far we can safely go in broadening 
our platform in the interests of some of the newer psychological 
systems which are clamoring for scientific status with all the 
vigor and scorn for tradition that characterize youthful 
movements. I confess that some of the caustic criticisms 
from the pages of Gordon Allport’s latest book (1) have 
helped to shake my philosophical self-complacency about the 
immutable criteria of a scientific psychology. 

The disturbing fact is that any persistent and searching 
inquiry into the criteria of psychology as science leads inevita- 
bly to a discussion of the essential principles of science itself. 
Unfortunately, there exists no Mosaical Decalogue to which 
we can appeal for guidance in this realm of thought, but 

1 Presidential address delivered before the Midwestern Psychological Association, 


Madison, Wisconsin, April 23, 1938. 
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standards do get established by some sort of whispering cam- 
paign and we suddenly awake to find everyone agreeing that 
this or that criterion of scientific concepts is absolutely es- 
sential. For example, witness the operational criterion. 
Several years ago, Bridgman (3) published his Logic of 
Modern Physics in which this principle was given expression. 
Many of us felt, at that time, that it was rather potent. But 
science had somehow worried along in its ignorance many 
years without this principle, at least explicitly stated. Now 
everyone is whispering or shouting about the operational 
definition of concepts as an essential step in the establishment 
of their scientific status. I merely wish to indicate the fact 
that standards do get established somehow by a process of 
infiltration. And it is just such a group of cumulative stand- 
ards that I now wish to call to your attention, with the idea 
of re-examining and re-evaluating them, as essential planks 
in a minimum platform. 

Here are some of the criteria of psychology as science that 
seem to form the framework of our current credo, and that 
play an active part, whether explicitly or implicitly, in shap- 
ing the course of our current research. 


Psychology as science is empirical. 

Psychology as science is mechanistic. 

Psychology as science is quantitative. 

Psychology as science is nomothetic: 1.¢., concerned with 
establishing general laws. 

Psychology as science is analytic. 

Psychology as science uses operational concepts. 


It is easy to expand this list, but I believe we should find 
that any additional criteria would be subsumed under one of 
these six statements. Take for example that ancient enemy 
of domestic harmony in yesterday’s psychological household, 
the mind-body problem. What vile epithets were hurled and 
what illustrious heads broken on this battle ground, before the 
whole problem was shown to reduce to a meaningless confusion 
over ambiguities in the definition of our concepts! Even 
lineal descendants of the Titchenerian dynasty admit as much. 
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Boring writes, “‘We now know that operational definitions 
can always transform a psychological description, expressed 
in terms of consciousness, into a description in behavioral or 
physiological terms. One does not have to like operationism 
to admit this fact. One has only to realize that it can be done 
whether one likes it or not” (2). Of course the philosophers 
are not all converted. Not long ago, I spent some precious 
hours reviewing a book, which had just been written by a 
well known contemporary philosopher (7), called The Revolt 
Against Dualism, in which at the end of 320 closely argued 
pages, he dismissed the movement in about the same tone 
an eighteenth century Englishman might have used in de- 
scribing the American Revolution, “a rather insignificant 
revolt by a handful of backwoodsmen, which was too unim- 
portant to warrant sending over a sufficient force of regulars 
to quell it.” But within the contemporary psychological 
world, a thoroughgoing dualist is as hard to find as a phrenolo- 
gist. What I wish to point out is that monism is a part of the 
credo of present day psychology, not because dualism has 
been, or could ever be, ultimately disproved on empirical 
grounds, but because monism is an inevitable corollary of 
two of our fundamental planks—operationism and mechan- 
ism. Mechanism forces one to monism because of the neces- 
sity for complete continuity in transfer of energy within the 
mechanistic system. Operationism automatically ledds to a 
monism in effect because of the ready transposability of 
operationally defined concepts. The entire dilemma of 
dualism-monism would be relegated by an operationist to the 
category of meaningless problems, like the question “Are 
there any boundaries to the universe?” or ‘‘Can there be a 
universe in which two times two is five?”’ Parallelism would 
automatically reduce to a double-aspect formula, because 
where two sets of defining operations coincide perfectly they 
become identical operationally. 

I should like to cite one or two other historic squabbles 
in psychology which, like the dualistic battle, really reduce to 
special cases subsumed under one or other of our fundamental 
criteria. Take the objectivity-subjectivity controversy. This 
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question is often treated as though it coincided with the 
problem of dualism. But that is an incidental relationship. 
Really the terms subjectivity and objectivity are two ex- 
tremes of a continuum in which degree of uniqueness and 
degree of social agreement are the defining variables, and in 
which the extreme of subjectivity would be a datum which is 
necessarily unique and capable of being reported on by only 
one person. Could such a datum be used by psychology as 
science? 

The question immediately carries us back to two of our 
major criteria, namely: that psychology as science is empirical 
and nomothetic. That is to say, it formulates general laws 
by inductive logic. One can not generalize from the unique 
instance, no matter with what rigorous control the occur- 
rence of that single instance was observed. Now it so hap- 
pens that this very problem is of immediate concern to the 
representatives of a new school of psychology, the personal- 
istic psychologists, because their major objective requires 
them to interest themselves both in the unique qualities of 
individual persons, and in unique events in the biographies of 
these unique persons. Hence to defend the status of their 
school as scientific psychology, they must make a frontal 
attack on the limiting power of those very criteria of psychol- 
ogy as science, to which I referred. But more of this later. 

These illustrations bring out the fact that most of the 
controversies of the past as well as of the present can be shown 
to revolve around the six major criteria of psychology as 
science which are also major principles of all science. Not 
all of these criteria are on an equal footing. Three of them 
are pretty widely accepted: namely, that psychology as 
science must be empirical, nomothetic, and operational. 
The other three: 1.¢., that psychology as science must be 
mechanistic, quantitative, and analytic, have been and are 
being subjected to constant attack. The question is, are these 
attacks legitimate? Will a re-evaluation of the criteria 
result in abandoning or modifying them? 

Let us begin by considering the proposition that psychol- 
ogy as science is analytic. Until the advent of the Gestalt 
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movement, the proposition was seldom seriously questioned, 
in psychology, that to understand a complex phenomenon it 
was necessary to analyze it into its elementary components. 
Titchener analyzed consciousness into sensory elements, 
Angell into functions, Watson analyzed behavior into reflexes. 
Intelligence, though first thought of and studied as a whole 
by the mental testers, was shown by experience to possess 
only a pseudo-simplicity rather than any real simplicity; so, 
that here, too, analysis is becoming the standard procedure. 
In opposition to all this, the Gestalt movement made 
explicit for psychology the doctrine that wholes possess 
properties over and above the parts or their interrelations. 
A parallel movement in philosophy, that of emergent evolu- 
tion, maintained the same principle. And latter-day physics, 
with its emphasis on field dynamics as opposed to the me- 
chanical concept of articulation of parts, paved the way for 
the doctrine of topological psychology, which attempts to 
explain the parts and their interrelationships in terms of 
field properties—that is to say, properties of the whole as-a- 
whole. Obviously, if we are to take seriously the criterion 
that psychology as science is analytic, we are compelled 
either to deny scientific status to Gestalt psychology, or else 
to recognize it as a special brand of analysis. The only other 
alternative is to deny that the analytic criterion is ultimate. 
As a matter of fact, we have excellent authority for doing 
just this. Bridgman (3), in the same book, Logic of Modern 
Physics, in which he stated the case for operationism, also 
paved the way for an attack on analysis as the exclusive ex- 
planatory procedure for science. He says, “There is a certain 
thesis implied in the general attitude of many physicists that 
an explanation of the universe is possible in which we start 
with small scale things, and explain large scale phenomena in 
terms of their small scale constituents; in other words that all 
the properties of the large are contained in the properties of 
the small, and that the large may be constructed out of the 
small. But to maintain this thesis would demand that ag- 
gregates of things never acquire properties in virtue of their 
numbers which they do not already possess as individuals.” 
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He then points out several exceptions to this rule in physics, 
in systems with combination terms in them, such as electrical 
energy, which contains the square of the field, etc. In all 
such cases the explanation of the phenomenon must be sought 
in the properties of the whole, not those of the parts. 

It is interesting and amusing to find the defenders of 
personalistic psychology attacking Gestalt psychology on the 
very ground that it is too analytical. Their argument is that 
the only legitimate whole or totality is the individual person, 
and that to talk in terms of Gestalten as independent entities 
is to split up the personal individual into meaningless non- 
personal elements. Now this is not just a case of the pot 
calling the kettle black. It is virtually a reductio ad absur- 
dum of the whole controversy. For it brings out clearly the 
fallacy lurking behind it. The term analysis is taken too 
literally. When confronted with any complex phenomenon, 
we study it now from one aspect, now from another. In so 
doing we should recognize the partiality and artificiality of the 
features which we temporarily isolate for study. Analysis in 
this sense is universal. In fact, in order to prove that the 
whole is more than the sum of the parts, it is necessary to 
abstract the properties of the parts, and discover that there 
are still some properties left over, which belong to the whole. 
So the very proof of Gestalt psychology is by analysis. 

I believe the objection to analysis grew out of a false 
notion of the purpose of analysis, the notion that it is aimed 
at discovering real concrete or irreducible parts. And this 
was, of course, actually the original conception of the method, 
borrowed directly from chemistry and erroneously applied to 
psychological phenomena. But William James criticised the 
analysis of consciousness into sensations, on the ground that 
there are no real sensations, nor even percepts, for that matter; 
just a total indivisible stream of thought. Others have 
criticized the analysis of behavior into stimulus-response on 
similar grounds, and in genetic psychology the theory of 
agglutination of reflexes has given way to one of mass action 
for the same reason. Yet, having once recognized that such 
analytic terms as sensory element, stimulus-response, or 
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reflex are convenient abstractions, we can go on using them, 
and really feel rather superior to those few, among the Gestalt 
psychologists, who haven’t yet discovered that their Gestalten 
are not real entities either, but are just convenient abstrac- 
tions which they have erroneously reified and endowed with 
indefinite potentiality. 

The attack of the personalistic school against analysis, 
however, really goes beyond the problem of analysis as such 
and carries over into an attack on a second of our principles, 
the principle that psychology as science is quantitative. For, 
say they, the main analytical method of studying personality 
has been the individual difference method, wherein a person 
is split up into a number of traits and then is assigned quanti- 
tative scores in terms of his position on a scale with reference 
to all other persons possessing these traits. It is this assign- 
ing of quantitative values to persons, just as we pin numbers 
on convicts, that is objected to; or, rather, the assumption 
that we can construct a concrete personality out of a collec- 
tion of scale values. The personalistic psychologist says 
you cannot describe a person by stating quantitatively how 
he differs from other persons, because the essential aspect of 
his personality is its uniqueness, its qualitative distinctness. 
This sounds like rank heresy. It is an attack on quantitative 
psychology in its most sacred shrine. It calls for a duel to 
the death to decide whether personalistic psychology and 
other brands that eschew quantitative method shall be hurled 
from the ranks of scientific psychology or whether, instead, 
the quantitative criterion shall no longer be counted as an 
essential plank in the platform of psychology as science. 

It is frequently said that America’s leading contribution 
to psychology has been the quantitative study of individual 
differences. This assertion is open to debate, but certainly 
quantification has been carried farther in this field than in any 
other. The measurement movement got its start only after a 
bitter struggle with the die-hards who were certain that the 
very notion of measurement or quantity as applied to mind 
or mental phenomena or personality was essentially ridicu- 
lous. It was only by pointing out that the term measurement 
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is used in a special sense, not in the additive sense that we 
measure with a ruler, but in terms of scalar separation, that 
the champions of the movement gained a foothold for their 
cause. Now comes the personalistic school and virtually 
reaffirms the old claim that you cannot measure personality 
without destroying individuality. Assuming this to be true, 
can there be a science of personal psychology? In fact, can 
we ever make predictions about individual persons? The 
problem is unique for psychology among all the sciences, for 
other sciences do not attempt such predictions. The physi- 
cist knows that he can predict what proportion of a mass of 
atoms will disintegrate within a given period of time, but he 
never worries about the fact that he cannot predict anything 
about when any individual atom will disintegrate. The 
physiologist sets up general laws accounting for the behavior 
of organs in health and disease, but individual diagnosis or 
prognosis in medicine is not a science. Itisanart. I believe 
a sensible viewpoint to take is the following: Psychology as 
science is necessarily abstract and general, dealing with the 
ways in which persons obey general laws or differ quantita- 
tively in respect to general traits. As science it should have, 
as an ultimate ideal, complete quantification of its laws, 
though that goal is a long way off, and should not be used as 
an absolute criterion in judging the worth of any current 
research. As good operationalists, we are bound to recog- 
nize that there are other valid operations than those of 
measuring, and other valid concepts derivable therefrom 
than the strictly quantitative type. 

So much for psychology as science. But in direct con- 
trast to the scientific aspect, the psychology of the concrete 
particular individual is not a science. It is an art, to be prac- 
ticed by the gifted few, and is not to be judged by the same 
criteria. It is to be judged by the standard of practical effec- 
tiveness. It can utilize all the principles that scientific 
psychology is able to contribute, but must do its own recon- 
structing, and recognize that 2 + 2 of such abstract principles 
does not add up to 4 in the concrete case, but to5or7. There 
is a certain amount of falsification in all generalization 
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and abstraction. In a sense, every scientific formula, con- 
cept, or law must be narrower than the sum total of the 
concrete instances from which it is derived. When Lewin 
(6), in the introduction to his excellent work, Dynamic 
Psychology, declares that the concepts of an adequate psy- 
chology must be “‘broad enough to apply to thought process, 
values and social relations in their mutual dependence as 
expressions of a concrete situation involving a definite person 
and condition,” he cannot mean psychology as science. He 
can only mean psychology as an art. When he uses the 
analogy of history and speaks of historical versus systematic 
concepts, he cannot be talking of history as science, but his- 
tory as narrative. History as science deals neither with 
concrete persons nor concrete events as such, but with 
general principles governing historical sequences. A better 
name for such a concrete approach would be biography, and 
biography is not science. 

Up to this point in our discussion we have examined two 
of the suggested criteria in our list, and have found that, with 
some restating, they stand as basic principles. But what of 
the third of the doubtful ones; namely, the mechanistic 
criterion? 

The difficulty with the concept of mechanism is that it 
can be and is interpreted in so many ways. The narrowest 
meaning is practically equivalent to mechanical action—that 
is to say, a form of action exactly on a par with that of a 
machine, in which the behavior of the whole is to be explained 
merely in terms of the sum of the actions of interdependent 
parts. But there are even physical processes which are not 
mechanistic in this limited sense. It is too limited a concep- 
tion even to take care of the phenomena of field dynamics. 
This conception of mechanism leads its devotees from time to 
time to construct crude mechanical models which are likely 
to be more misleading than helpful because of their over- 
simplification. 

A broader meaning of mechanism practically identifies it 
with the concept ‘physical.’ It is this meaning which leads 
to the distinction sometimes made between mechanism and 
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vitalism on the assumption that life or organic nature re- 
quires for its explanation some principle or element over and 
above the inorganic. Such a principle is vaguely called 
directive. But, as is often pointed out, it is perfectly possible 
to assume the emergence, at the organic level, of a more 
complex stage of organization accompanied by the manifesta- 
tion of somewhat novel properties, without assuming any 
break in the natural series. An expanded physics and chem- 
istry could ultimately incorporate such phenomena. No new 
forms of energy need be assumed. No ontological dichotomy 
is involved. There is nothing to prevent such a view from 
being called mechanistic. It is merely a more comprehensive 
brand of mechanism. Let us call it ‘third level mechanism.’ 

Then, finally, there is the broadest mechanism whose 
cardinal assumption is merely that there is a complete and 
unbroken continuity in the cause and effect chain leading to a 
given response. This form of mechanism brings human be- 
havior under the broad category of natural events, capable 
of being reduced to laws or principles of uniformity. It 
affirms the applicability of inductive logic to the psycholo- 
gist’s sphere. It is contrasted with any system that as- 
sumes spontaneity, capriciousness, transcendental determi- 
nants of behavior, or teleology. It is not necessarily con- 
trasted with purposive or goal-oriented action, as some of our 
most prominent mechanists in psychology have so beauti- 
fully and adequately demonstrated in recent years. In fact, 
a person could be a mentalistic psychologist, with a preference 
for talking about ideas as determinants of action, and still be 
a complete mechanist in this last sense. 

Now it seems to me that, where any lingering objections 
exist, on the part of certain psychologists, to being classed 
as mechanists, such objections are due to the identification 
of the term with one of the narrower meanings given before. 
It might be objected that we have broadened the meaning of 
the term so much that it no longer has any definitive value. 
I believe it does have definitive value, in sharply excluding 
any theory which implies teleology, or any theory which im- 
plies a break in the natural causal series. I believe it erects a 
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barrier against the encroachments of all the crazy fringe of 
half-baked ideas and theories that surrounds scientific 
psychology and hems it in on all sides. It creates and main- 
tains a much needed rapprochement between psychology 
and the other biological sciences. 

And so, after some broadening, and shifting in emphasis, 
and redefining, the three doubtful criteria of psychology as 
science, the mechanistic, analytic, and quantitative, are 
found to still hold an important place in our theoretical 
platform. 

But what of the other three criteria that were grouped 
together as being the more firmly established, 7.¢., the em- 
pirical, nomothetic, and operational? Paradoxically enough, 
it is one of these, the operational criterion, which seems to offer 
most difficulty when put to the pragmatic test. In fact, its 
acceptance raises so many questions that we will need to 
consider them at some length. 

At the outset, our suspicions are naturally aroused that 
all is not well, by the fact that two of our most articulate 
apostles of operationism—S. S. Stevens (10) and J. R. Kantor 
(s)—should arrive, in recent published articles, at such 
diametrically opposed conclusions, as a result of allegedly 
applying the same operational criterion to their respective 
concepts. ‘To misquote the immortal bard, Omar Khayam, 
‘Each one comes out by that same door wherein he went.” 
For Kantor comes out with a strict behavioral objectivism; 
while Stevens, on the contrary, emerges with a water-tight 
justification, on operational grounds, of such ultra-subjective 
categories as experience, sensation, and sensory attribute. 

I believe that this divergence of views is significant, even 
though I cannot agree with Kantor that there is any necessary 
dualism implied in Stevens’ position. The point is that the 
concept ‘operational’ will have to be a wise doctor, and take 
its own medicine. It must be submitted to the same criterion 
to which it proposes to submit all other concepts, namely, 
that of discovering the operations in terms of which it is 
defined. And when we do this, what do we find? We find, 
for example, that Kantor defines operational as meaning 
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interbehavioral. In the light of his brand of realism, this 
means direct and immediate responses to objects and events, 
as contrasted with responses to secondary constructs. Hence, 
he would say, visual sensations, as responses to light waves, 
could never be operationally defined, because light waves are 
secondary constructs of the physicist, not real objects or 
events. And, on the other hand, Stevens defines operations 
in terms of the very operation of defining itself, which is the 
most fundamental operation of them all. Hence, for him, 
discrimination, whenever and wherever it occurs, is an opera- 
tion capable of yielding scientifically sound concepts. In- 
trospection is a discriminating operation and therefore can be 
used in the definition of concepts. 

This seems to bring us to an impasse. How are we to 
settle the question of what is and what is not a legitimate 
defining operation? Bridgman fails to answer the question, 
except to suggest that for physics the operations should be 
physical in nature, and that, in general, for any science, the 
operations used in defining its concepts should be operations 
which are fundamental for that particular science. What 
operations are fundamental for psychology? I find myself, 
at this point, in sympathy with Kantor’s dictum, that funda- 
mental defining operations must be of the experimental sort; 
that is to say, they must be made upon real concrete phe- 
nomena, not upon secondary constructs. But I cannot agree 
with his ideas as to what constitutes reality or concreteness. 
I do not share his opinion that the act of discriminating can- 
not be a real experimental operation which is capable of 
yielding valid operational concepts. Therefore I believe that 
mentalistic concepts, like percept, image, idea, can be opera- 
tionally defined concepts. If necessary, they could be trans- 
lated either into brain terms or behavior terms, but would be 
no whit more operational as a result, because the brain or 
behavioral equivalents are already implied, if the original 
experimental operations were carried out, or could conceivably 
be carried out. This added proviso is important because it 
paves the way for those valuable intuitive flights of scientists 
which, while not themselves based on actually performed 
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experimental operations, nevertheless do not carry them be- 
yond the limits of possible operational checks. It is these 
borderline constructs that are frequently the most vigorous 
and fruitful for the science. 

A second source of ambiguity in discussing the operational 
criterion arises from the fact that you cannot necessarily tell 
by the name given to a concept whether it is or is not opera- 
tional. We do not name our concepts in terms of the experi- 
mental steps by which we validate them. Therefore the 
same concepts, by name, may be strictly operational as used by 
one person, and quite the opposite as used by another. I can 
scarcely think of a single concept in psychology that has not 
suffered both of these fates. In fact, every concept goes 
through a gradual metamorphosis. Some, like the concept 
‘conditioned reflex’ or ‘transfer’ or ‘I.Q.,’ start out as opera- 
tional, but become reified by use until they are eventually 
endowed with indefinite potentiality, so that we fallaciously 
try to explain processes in terms of them. Others, like the 
concept ‘emotion’ which started out as a reified entity, have 
undergone a chastening at the hands of the experimentalists 
until they are now much more sharply defined. And occasion- 
ally this chastening process results in the almost complete dis- 
appearance of concepts, like instinct, whose only justification 
for existence is their venerability. 

While we are discussing this point, I wish to register a 
vigorous protest against a tendency which has been all too 
prevalent, especially in the teaching of psychology. It is 
the tendency to assume that there resides some magic property 
in words as such, whereby they have unlimited potentiality 
for good or evil. The result is that the education of many of 
our embryo psychologists consists largely in acquiring a lot of 
inhibitions against the use of particular words. I remember 
being severely castigated very early in my teaching career, 
for letting that word ‘mind’ slip into one of my lectures. The 
word was taboo. It was not mentioned in polite society. 
Now this tendency to avoid words, as words, is as definitely 
a violation of the operational ideal as the opposite tendency to 
reify concepts, because it doubtless springs from the same 
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source. The very persons who insist on these taboos, are 
likewise guilty of reification of those concept names like 
‘reflex,’ or ‘behavior,’ or ‘response,’ which happen to be 
among the list of words which have been sprinkled with the 
holy water of their approval. It cannot be emphasized too 
strongly that the validity of a concept cannot be determined 
by the particular verbal garments that happen to conceal its 
nakedness from the public gaze. 

I have one more qualification to make with regard to the 
operational criterion. One of the ideals of scientific concept- 
makers is to reduce all concepts to a few fundamental ones, 
which are considered basic for the science, whether or not 
they are looked upon as ultimate. This procedure is thought 
essential in setting up an integrated system. Yet this is not, 
by any means, the likely outcome of operationally defined 
concepts. Just the opposite, in fact. For there is no uni- 
versal set of operations. There are many different sets, each 
having arisen out of a different experimental need or situation. 
For example, Spearman’s ‘G’ is a concept derived from second 
order mathematical operations. The concept, ‘conditioned 
response,’ is derived from first-order operations of quite a 
different type, and that of ‘Gestalt’ from still another sort. 
Any modern psychology text is an eloquent witness to what a 
jumble of loosely related and often incompatible concepts 
have resulted from our habit of relying on many diverse 
experimental approaches for their definition. If we finally 
do achieve a unified system of concepts for psychology it will 
not be by virtue of the operational criterion so much as 
through the pragmatic need for bringing order out of chaos, 
and for creating a common language out of the babel of 
tongues which may be heard at any one of our national 
association meetings. 

This completes our re-evaluation of the major criteria of 
psychology as science. Thus far, however, no attention has 
been given to the criterion which is by far the most important 
of them all, the empirical standard. I use the term empirical 
in its broadest meaning as equivalent to the phrase ‘knowledge 
based on experience,’ but I have especial reference, of course, 
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to the experimentally derived sort of knowledge. It would 
seem that there was little that needed to be said in defense of 
this principle of science. The idea that science is based upon 
experimental facts and the inductive method is supposedly so 
firmly established that the two words ‘science’ and ‘empirical’ 
seem practically synonymous. We have appropriate words to 
express our suspicion of the opposite methods of thought, 
such as ‘rationalism,’ ‘scholasticism,’ ‘dogmatism,’ and the 
like. The ground which was won by the efforts of Galileo, 
Bacon, Newton and others has been taken for granted so long 
that it seems fantastic even to suggest that that same ground 
is in danger of being lost. Yet I feel constrained to call at- 
tention to certain tendencies in current thought which, to say 
the least, are rather disturbing. ‘Taken in conjunction with 
the course of events in the world at large, particularly the new 
political ideologies, these tendencies become highly signifi- 
cant. Perhaps some excess of apprehensiveness can be fore- 
given on the part of those of us who have, in the last few 
years, witnessed the steps by which a preeminent midwestern 
university, that was dedicated, from its very beginning, to the 
advancement of the empirical ideal of science through experi- 
mental research, has been suddenly converted into a forum 
for the dissemination of mediaeval scholasticism. Perhaps 
we can be excused for some anxiety at hearing it so frequently 
reiterated that research laboratories have no legitimate place 
in universities because the main function of universities is to 
train students in dialectic. 

It is unfortunate that the majority of scientists are not 
yet awake to the menace of the broader movement of which 
this local upheaval is merely the spearhead. ‘Those who look 
upon it as a rather amusing interlude, a tempest in a teapot, 
a manifestation of the delayed adolescent bumptiousness of a 
small group of self-constituted intellectual dictators, should 
have their eyes opened to the widespread sympathetic hearing 
which this revolt against empiricism is receiving in extra- 
scientific academic circles. It is eagerly seized upon by the 
humanists, and those intellectual disciplines whose dominance 
has been threatened by the onward march of empirical science. 
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It is given a sympathetic ear by those erstwhile overenthusias- 
tic lay supporters of science who have been disappointed 
at its failure to furnish an immediate panacea for all the 
knotty problems of human life, and who forget that, whereas 
empirical science has had a scant hundred years of actual 
opportunity to justify itself, rationalism has already loafed 
through two thousand years of pampered sterility. And, 
sad to relate, the soil has been, in part, prepared for the 
counter-revolution by certain scientists themselves whose 
eagerness to give a finished appearance to the crude structures 
of their sciences has led them to indulge in premature, 
dogmatic and sweeping generalizations. But whatever the 
cause, the fact is undeniable that empirical science is again 
facing a struggle for existence, and the local turmoil is just the 
rumble that precedes the storm. 

Things that happen at our own doorstep have a way of 
forcing unpleasant truths upon us. Events that take place at 
a comfortable distance leave us cold. As a matter of fact we 
have all witnessed the flight to this country of the outstanding 
scientists, from certain other countries, who found their 
empirical ideal totally at odds with the blighting menace of 
complete intellectual dictatorship. But we have not been 
deeply stirred out of our own complacency because we as- 
sumed that the whole thing went hand in hand with political 
dictatorship, and that we, living in a democracy, were im- 
mune from such a menace. But we should not lose sight of 
the fact that the spirit of authoritarianism is a contagion that 
can infect every realm of thought, and that intellectual ab- 
solutism can flourish in a democracy. 

Therefore, I am moved to suggest, in all seriousness, the 
fantastic prophesy that we may even now be living in the 
twilight years of empiricism, and that an era of intellectual 
dogmatism and authoritarianism may be just around the 
corner. Ideas which are looked upon as fantastically absurd 
by one generation are accepted in all seriousness by the next. 
We have illustrations of this every day. Just repeat an 
absurdity often enough and it becomes an accepted belief. 
An example might be drawn from.modern art. Could the 
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artists who scoffed or chuckled at the ‘Nude Descending the 
Staircase,’ when it first appeared, have imagined that, a few 
years later, everyone would be taking it seriously? Could 
they have predicted, then, that the ideas of fur-lined teacups or 
two square blocks of stone embracing each other would one 
day find their way into a serious metropolitan art exhibit? 

It is difficult to evaluate properly the symptomatic sig- 
nificance of straws in the wind at the time. Later, looking 
back, we can see that they pointed the course of events. 
There are straws in the wind, now, pointing to the eclipse of 
empiricism. Remarks like the following, emanating from 
highly respected sources, are too infrequently challenged: 
“If the experimental facts fail to support the theory, then so 
much the worse for the experimental facts.” 

Psychology as science should take up the challenge and 
play a part in counteracting this trend away from empiricism. 
We might well adopt the platform of ‘radical experimentalism’ 
as formulated by H. S. Jennings (4) in 1927. In discussing 
the attitude of those whose ideal of scientific method is 
mainly rationalistic and who think “‘ that continued recourse to 
experimentation is a mere device of feeble minds,’”—*“‘ This”’, 
he said, “‘is hard on those of us who find that the only method 
by which we can progress is by observation and experiment. 
. . « I have heard a brilliant experimentalist say that the 
moment he moves a few inches beyond his experimental 
results, he goes wrong, as his next experiment shows. He 
must stick to experiment or progress stops and error begins.” 

Psychology as science cannot afford to listen to the siren 
calls of neo-rationalists and neo-scholastics with their little 
logic-tight predetermined world. Our path lies in the op- 
posite direction. We cannot afford to forget the ideal of 
empirical science embodied in the words of Isaac Newton 
(9), “In this philosophy, propositions are based on phe- 
nomena, and laws are derived by induction.” 
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A CRITICISM OF HULL’S GOAL GRADIENT 
HYPOTHESIS! 


BY JACK BUEL 
Wesleyan University 


Professor Hull’s theory enjoys a unique status among 
theoretical systems of psychology. Whereas most new 
theoretical approaches begin by criticising other theories, 
Hull’s starts from premises which have for a long time been 
subjected to lengthy criticism. Undoubtedly this fact has 
tended to give Professor Hull time to complete the outlines 
of his theory without the necessity of answering many criti- 
cisms specific to his point of view. But, now that the outline 
is clear, the evidence that has been accumulating under the 
stimulus of the theory makes a critical examination desirable. 

Professor Hull’s methodology begins with an analogy to 
Euclidian geometry, namely, that given a relatively few 
axioms many theorems may be deduced, the postulates or 
axioms being primary principles, the theorems secondary 
principles. The concern of this paper is with those features 
of the axioms and theorems related to the goal gradient 
hypothesis, though the axioms themselves might be a good 
starting point for a critical examination of the whole system. 
It may be pointed out that definitions which contain expres- 
sions such as ‘stimulus complex,’ ‘behavior segment,’ or 
which define ‘correct’ or ‘right’ reactions as those behavior 
sequences which result in reenforcement are less definitions 
than programs for research. Hull’s axioms contain concepts 
that have been for years, and still are, the basis for experi- 
mental research. It is probably desirable not to expand this 
criticism here. The point is merely stated to indicate that 
the concentration on one aspect of the theoretical position 
does not mean that there are not other domains of a contro- 
versial nature. 

1 This paper, with some alterations, was presented at the meetings of the Psycho- 


logical Round Table, December 4-5, 1937, Hartford, Connecticut. 
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The goal gradient mechanism, as Professor Hull has for- 
mulated it (16, 17, 18), is hypothetically established for 
mazes through the process of an excitatory fragment of the 
goal response occurring earlier and earlier in the behavior 
sequence antecedent to the attainment of the goal at subse- 
quent trials. This excitatory response with the goal drive as 
its persisting stimulus decreases with distance from the goal, 
the effect being to reenforce differentially correct responses 
and to inhibit incorrect responses, the degree of reenforce- 
ment or inhibition being determined by the spatio-temporal 
distance of the particular reaction from the goal. Since the 
discrimination of distance to the goal by the rat is subject to 
Weber’s law, it follows that the discrimination between a right 
and a wrong response near the goal is made more readily than 
one at some distance from the goal. Because of this fact 
more errors are made at the beginning of the maze than at the 
end of the maze. The chief experimental sources substan- 
tiating this hypothesis are the conditioned reflex experiments 
as typified by the work of Hilgard, and the experiments with 
the animal maze, especially the animal linear maze. 

In the animal linear maze a heavily weighted factor, the 
last correct choice, not the turn itself, determines to a large 
extent the magnitude of the errors into the blinds preceding 
it. In other words, all blinds pointing in the same direction 
as the last correct choice of the maze are entered more fre- 
quently than those pointing in a direction opposite to the 
last correct choice (6). The frequency of entrances increases 
into those sides of maze units in the direction of the last 
correct choice of the maze as the end of the maze is ap- 
proached. ‘Those units in which the blinds point in an oppo- 
site direction decrease in error frequency as the end of the 
maze is approached. We therefore obtain two opposed error 
gradients, both of which are due to the single factor of antici- 
pating the last correct maze choice. It would seem from this 
situation that the magnitude of the errors at any particular 
unit is accidental to the direction of the blind with respect 
to the direction of the last correct maze choice. In other 
words, if any particular blind were in such relationship to the 
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last correct choice of the maze that entrances into it were in 
the same direction as the last correct choice, the magnitude 
of errors into that blind would be greater than the magnitude 
of errors into a blind which by chance of maze design hap- 
pened to be in an opposite direction. It would be possible, 
therefore, so to arrange blinds that several error gradients 
could be obtained. Some of these would be consistent with 
the demands of the goal gradient mechanism, and some of 
the resulting error gradients would be inconsistent with it.’ 
But, the goal gradient is supposed also to operate to increase 
the number of correct responses as the end of the maze is 
approached, as well as to decrease the total number of errors. 
Fortunately, it is possible in the linear maze to separate the 
gradient for correct responses from that for incorrect re- 
sponses. This is illustrated in the graph in which the ordinate 
represents the number of turns per day per rat per unit for a 
period of 194 days and the abscissa represents maze units, 
unit 1 being the first and unit 7 the last of the maze (see 
Fig. 1). Points are plotted in the graph for those units in 
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Fic. 1: Average number of turns in the same direction as the last correct choice 


for each maze unit from day 31 to 225 inclusive. 
The ordinate represents the averajre number of turns per day per rat for each unit, 
the abscissa the maze units from starting box to the maze exit. The dotted lines 


represent extrapolated portions of the lower curve. (From Buel.) 

2 It is obvious that this is not a direct criticism of the goal gradient mechanism 
since the gradients obtained may be due to the varied conditions of the experiment 
rather than to unsystematic changes in the basic mechanism. However, any partic- 
ular error gradient may be the result of maze design. 
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which a turn in the direction of the last correct choice would 
lead the animal into the true path. These units are 1, 3, 5, 
and 7, and the upper curve indicates the average number 
of correct choices per unit per day per rat for these units. 
The lower curve represents the data for units 2, 4, and 6. 
Conversely, for these units a turn in the same direction as the 
last correct choice would lead the animal into a blind. The 
lower curve, then, shows the average number of errors per 
day per rat per unit. 

Considering the lower curve we see that as the end of the 
maze is approached more and more turns are made in the 
same direction as the last correct choice. In other words the 
error score actually increases as the end of the maze is ap- 
proached. This increase can only be explained on the as- 
sumption that the expectancy of the last correct choice in- 
creases at a definite rate as the end of the maze is approached. 
Considering the upper curve we find that the rate of increase 
(1.¢., slope of the upper curve) is almost identical with the 
lower curve. If we assume that the expectancy of the last 
correct choice of the maze was working for these units in the 
very same way and with the same force as for units 2, 4, and 6 
(lower curve), we can see why these two curves should have 
the same slope, the changes in both being due to the same 
factor. However, if this factor were the only one acting on 
both curves, we should expect them to be superimposed. 
Actually as is apparent from the graph, these two curves 
while having the same slope run at different levels. This 
phenomenon of parallel curves could easily be explained if we 
would postulate a given constant force working throughout 
units I, 3, 5, and 7 and not for 2, 4, and 6. With this set-up, 
1.¢., one variable force affecting both curves in the same way 
and one constant force affecting only the upper curve, the 
graph becomes intelligible. The variable force is the ex- 
pectancy of the last correct choice. The constant force 
working on units I, 3, 5, and 7 is due to the fact that a turn in 
accordance with the expectancy force is also correct. This 
would mean that the factor of correctness reenforces a given 
response (which response might have first been induced by the 
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expectancy factor) equally throughout the maze. Correctness 
then, does not at all obey the goal gradient hypothesis, but is 
a constant (6). 

Any other postulations as to the causes of these curves 
lead to the'necessity of negative slopes in one of them. Since 
experimentally this is not the case, we may assume that the 
above analysis has a reasonable validity. 

As to the nature of the response which is supposed in 
Hull’s theory to set up the gradient, a series of experiments 
by Buel and Ballachey (8) on the choice-point expectancy 
factor is enlightening. The goal gradient can be assumed to 
be established in linear mazes by the fact that the last correct 
turn of the maze is a response which occurs earlier and earlier 
in the behavior sequence. Superficially this is true. It is 
when we examine the nature of this end response in the maze 
that we are confronted with the possibility that it is the 
meaningful nature of the last correct choice rather than the 
bare existence of a turn response which influences the preced- 
ing choices. The experiments indicate that for the last cor- 
rect choice direction of the maze to be effective in determining 
the choices in the preceding units of a linear maze, the pre- 
ceding units must be structurally similar to the last unit of 
the maze. If, for example, the last unit of the maze was a 
T-unit while the preceding units were Y-units or open spaces 
which allowed the rat to express a right or left preference, the 
choices in these latter units were unaffected by a forced turn, 
error choice or free choice situation in the last maze unit. 
But if the last unit was a Y-unit structurally similar to a 
preceding Y-unit, the correct choice direction of the last 
unit was found to be effective in determining the choices in the 
preceding structurally similar unit. The maze in this 
latter case was so constructed that the food was. centrally 
located with reference to the long axis of the maze. In the 
case where the last unit was a T-unit, the food was necessarily 
at one side of the long axis of the maze for the forced turn and 
error choice groups, and on both sides for the free choice 
groups, but this difference did not at all affect the preceding 
choices. This experiment indicates that where the last cor- 
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rect maze choice is not effective, neither is the food position 
effective in determining the choices of rats in the preceding 
maze units. It would seem, if the above observations are 
correctly interpreted, that the goal gradient hypothesis insofar 
as it involves error scores will have to be modified. Only 
under certain conditions will the errors progressively decrease 
as the end of the maze is approached. They will increase or 
decrease as the various choice points are similar enough to 
the last one to release or fail to release expectations that such a 
choice will lead to food, or the correct path to food. The 
results would also tend to show that the last unit must involve 
a problem to be effective. It must be a choice of some sort. 
A turn alone varies the structure of the last unit so that it 
cannot be identical with the preceding choice points. If the 
last unit involves an error in the choice, thus emphasizing 
the problem characteristics of the choice (as compared with a 
free choice) a greater effect in the choices of the preceding 
similar choice-points results. It can be shown, within the 
range of experiments cited, that only when choice situations 
are similar does the influence of the last choice make itself 
felt upon preceding choices, and that while the response, 
such as a turn, might well be similar to turns in preceding 
units, only those turns involving a choice are effective.* 

It would seem from these findings that the basic concept of 
the goal gradient breaks down, since the excitatory fragment 
of the goal is a simple response of some sort or another which 
is not formulated in terms of its significance to the problem 
at hand of learning the maze, the true path, the correct asso- 
ciations to be made to a list of words, etc. It might be pointed 
out here that Hilgard’s experiments which show the tendency 
of the conditioned response to occur at earlier intervals upon 
successive reenforcements may be interpreted in much the 
same manner as similar processes have been described by 
Tolman, namely, in terms of the significance of the response 
to the whole stimulus situation (14). Tolman (26) has called 
such stimulus-response processes sign-Gestalts to take account 

* This point of view has further verification in the series of experiments reported 


by Yoshioka (31, 32). For the relationship of these experiments to reward and maze 
choices, see Buel and Ballachey (9). 
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of the meaningfulness of the connections. It is almost need- 
less to cite the Russian reflexologists’ concept of biological 
utility in the conditioned response to document the above 
point of view. Hull disavows any attempt to complete 
mechanical prediction, but the assumption that a meaningless 
response which happens to be present at the goal or coexistent 
with the consummation of the goal should influence responses 
antecedent to it at some later trial seems to overlook the 
wide significance of the Gestalt, and even the strictly stylized 
experiments of our most behavioristic laboratories. 

Let us turn now to spatial mazes. It is here that one 
may offer the most serious criticism, not only of Hull’s 
system, but of any system which assumes that only one or 
relatively few factors influence the weights and the interfunc- 
tions between the specific maze factors which can, and have 
been proven to cause errors. At one time an attempt was 
made to catalogue a partial list of such factors (7), and much 
to the cataloger’s amazement it was found that over ninety 
specific causes of errors could be set down without undue 
combing of the literature. There is no doubt that many 
more could be found by someone more diligent., Let us take 
one such factor and examine how it operates to cause errors. 
The factor selected for purposes of illustration is centrifugal 
swing. This concept, or factor, was first investigated by 
Schneirla (24) in relation to the behavior of ants and later the 
concept was applied to rats (1, 3). When a rat is forced by 
the pattern of the alley in the maze to make a turn, he is 
forced, if running at any speed at all, to the outer wall of the 
alley beyond the turn so that upon reaching the next bifurca- 
tion he is nearer one entrance of the choice-point than the 
other. The weight of this factor in causing errors may be 
indicated by its effect in two twelve-unit mazes of different 
patterns. In the first trial, and previous to the rat’s experi- 
ence with mazes, 85.4 per cent of the total errors are attribu- 
table to centrifugal swing in one maze and 88.2 per cent in the 
other where chance is 41.7 per cent (9). 

This very heavily weighted cause of errors is independent 
of goal gradients, since it operates on the first trial. Perhaps, 
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then, later trials will show some change in the rate at which 
different blinds are being eliminated due to the intrusion of 
the goal gradient. The corrected correlation coefficients ex- 
pressing the relationship between the error order of the blinds 
for the first trial and the error order of blinds for trials 2-17 is 
.89 for one maze and .85 for the other. The reliability coeffi- 
cients for the first trial-error orders are .96 and .97 respec- 
tively, and for trials 2-17, .98 and .97 respectively (9). The 
first thing to note about these correlation coefficients is that 
they indicate high specificity, and high degree of relationship 
between first trial-error factors and later trial-error factors. 
In terms of the degree of communality of function existing 
between the first trial-error order and later trial-error order, 
the degree of communality lies somewhere between 79 per 
cent and 89 per cent for the highest coefficient. Hence, 
between II per cent and 21 per cent of the error order is de- 
termined by residual factors not common to first and later 
trials. These percentages allow room for the operation of the 
goal gradient, but as a determiner of the relative order of 
difficulty of blinds when the factor of centrifugal swing ac- 
counts for some 88 per cent of the errors on the first trial, the 
goal gradient mechanism is evidently not as potent as it 
should be in Hull’s formulation. Further, note that centri- 
fugal swing is an almost mechanical determiner of errors. 
It is relatively independent of the psychological variables in 
maze learning. It depends for its operation upon the physical 
dimensions of the maze, the weight of the animal, and the 
speed at which he makes some specific turn in the maze. In 
other words, such a factor as this one would necessitate a 
new postulate in Hull’s system. But many such factors 
exist, if we may accept the evidence given by studies indicat- 
ing the reliabilities and relationships of first trial-error orders 
and later trial-error orders of blinds on diverse types and 
patterns of mazes. How useful would a learning theory 
be which contained some ninety special conditions for one 
type of problem? In human maze learning other kinds of 
factors determine errors. Some of them are no doubt as 
independent of the goal as centrifugal swing is for the rat in 


the maze. 
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An experimental approach to determine the rdle of such 
factors in animal mazes is illustrated in a study now under 
way. The fourteen unit maze pattern known as the Stone- 
Nyswander maze and used extensively at the University of 
California by Tolman and his students is being examined in 
terms of the error orders of first and later trials when the basic 
fourteen unit maze pattern has been placed in 22 unit maze 
settings, in 18 unit maze settings, is in the form of alley mazes, 
elevated mazes, mazes with tilting floors, and interchanged 
units, and used with blind rats, anosmic rats, albino rats, 
hooded rats, etc. (see Fig. 2). Since the reward in these 
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Fic. 2. Diagrams of the mazes from which the correlations for the basic 14-unit 
maze pattern were obtained. 

The cross-hatched portions of the diagrams indicate the original Tolman-Honzik 
basic 14-unit maze pattern. J represents Dr. E. Ghiselli’s 18-unit alley maze (normal 
rats;G-18). B represents the original Tolman-Honzik alley maze which was used with 
rats under different conditions of drive and reward (normal rats; HR, HNR, LHR, 
LHNR). C represents Dr. J. A. Hamilton’s 22-unit elevated maze (blind rats; Ha-22). 
The Honzik 14-unit elevated maze pattern which is not shown here has proportions 
similar to the basic units of pattern C. These various mazes were set up and used in 
quite different environmental situations and locations, and, of course, the handling, 
food schedules, etc., varied with the different experimenters. 
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Tue CorRELATIONS BETWEEN THE Error Orper or BLINDS FOR THE 
First Maze TRIAL AND THE Error Orper or BLINDs FoR TRIALS 
2-II FOR THE Basic 14 Maze Units * 
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HR = Tolman-Honzik (27, 28), hungry rewarded group, 14 units, alley maze. 

HNR_ = Tolman-Honzik (27, 28), hungry non-rewarded group, 14 units, alley maze. 

LHR’ = Tolman-Honzik (27, 28), less hungry, rewarded group, 14 units, alley maze. 

LHNR = Tolman-Honzik (27, 28), less hungry, non-rewarded group, 14 units, alley 
maze. 

G-18 = Ghiselli (rz), 18 unit alley maze, normal hungry rewarded rats. 

Ha-22 = Hamilton (12), 22 unit elevated maze, blind hungry rewarded rats. 

HFMN = Honzik (15), fixed maze units, elevated maze, normal rats. 

HFMB = Honzik (15), fixed maze units, elevated maze, blind rats. 

HCMB = Honzik (15), interchanged maze units, elevated maze, blind rats. 














mazes is at different positions and distances from the basic 
fourteen units in these various mazes one would expect, on the 
basis of the goal gradient mechanism, that the correlations 
between the error orders of the first maze trial and later 
maze trials would be markedly different. This is not the case. 
In the 14 unit alley maze with hungry food-rewarded rats 
the correlation between first trial-error orders and later trial- 
error orders of blinds is .83 (see Table I). When the same 
fourteen units are placed within a 22 unit maze setting, the 
correlation for these same basic fourteen units is .93. It does 
not seem clear how the goal gradient, if it operates at all, 
has maintained the same relative relationship between the 
first and later trial-error order in the two situations when 
the reward is in such dissimilar positions in space for the two 
settings. 


‘The writer wishes to express his indebtedness to Professor E. C. Tolman, and 
Doctors. C. H. Honzik, E. Ghiselli, and J. A. Hamilton for their generous permission 


to use their data for this analysis. 
5 The technique involved in obtaining the corrected correlation coefficients and 


also the general method for obtaining the original coefficients is explained in Ballachey 
and Buel (2, pp. 362-363). 
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In nine correlations between the error order of first trials 
and later trials with the basic fourteen units in different 
settings, the animals under different conditions of drive, of 
sensory deprivation, etc., the median corrected correlation 
coefficient is .79, the range from .52 to .93. The reliability 
coefficients for the error order of blinds for the first trial are all 
above .g1, the median coefficient being .95, while for later 
trials the reliability coefficients are all above .97 with the 
median coefficient equal to .99. It seems most improbable 
that the goal gradient mechanism is working in this maze 
when the drive stimulus postulated as necessary for the con- 
tinuing excitatory stimulus is presumably absent from the 
non-rewarded groups and yet the correlations are around .79, 
or that when the reward is in dissimilar locations and distances 
from the crucial blinds that the correlations are relatively 
unchanged. 

The fact that maze errors are determined to a large extent 
in the first maze trial, and hence, are to that degree inde- 
pendent of maze experience, and hence, of goal gradient, goal 
orientation and the like, is not confined to animal learning. 
Preliminary reports have been made by Brown and Buel (4) 
and Buel and Brown® which indicate that the first trial 
response patterns are highly determined in human tracing 
mazes. The relationships of these response patterns to later 
trials have not been determined, but preliminary experiments 
indicate that what was found for animal learning is likely 
to be the case with human learning. 

Some time ago an attempt was made by Spence (25) to 
determine the theoretical order of difficulty of blinds to be 
expected in the Tolman-Honzik mazes if the sole determiners 
of errors were the goal gradient mechanism and goal orienta- 
tion. The correlations he obtained between the theoretical 
order and the obtained order of difficulty of blinds was .go 
and .g1. Presumably he used data from the hungry-re- 
warded groups. Ballachey and Buel (2) computed the corre- 
lation for the hungry non-rewarded group. This correlation 


* Paper presented before Section I of the annual meeting of the A.A.A.S., De- 
cember 27, 1937. 
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is .92 and definitely proves that the goal orientation of Spence 
was not food orientation. A more crucial point is that the 
correlation between Spence’s theoretical order and the ob- 
tained order of difficulty of blinds for the very first maze trial 
is .65 for the hungry non-rewarded group and .66 for the 
hungry rewarded group. These correlations indicate that a 
number of specific factors unrelated to the goal gradient and 
goal orientation (because they are present in the first trial) 
determine the relative difficulty of blinds. Spence has raised 
objections to the conclusion of Ballachey and Buel that his 
coefficients are spuriously high. He defends the validity of 
his coefficients on the grounds that by chance the food- 
pointing blinds were heavily weighted from the start because 
of the design of the maze. But as indicated above, this same 
maze in 22 unit maze settings shows almost the same correla- 
tions between first and later trials as the fourteen unit maze by 
itself. It may be added that convincing proof is available that 
either goal orientation is a function resulting from maze design 
or that if it does exist as a factor causing errors its weight is 
relatively unimportant (2, 8, 9, 31, 32). 

To indicate again that the correlations between the error 
orders of the first maze trial and the error orders of later trials 
is probably a general case, Tryon ’ made correlational analyses 
of the error orders of adjacent trials in the 17 unit T-maze 
used in his study of the inheritance of learning ability in rats 
(29). The correlations between the error orders of adjacent 
trials are very high, approaching unity, except for the correla- 
tion between/the first and second trials. Tryon has evidence 
to support the contention that the lower coefficient obtained 
between the first and second trial-error orders is in large part 
due to initial emotional adjustment. The point to emphasize 
in this connection is that not only may some new factors 
have been added to the second trial, but also that factors which 
were present in the first trial have dropped out (30). 

Tolman and his students’ work on latent learning in 
mazes (5, 13, 27, 28) has an important bearing upon the goal 
gradient mechanism. In these studies, the rats were al- 

7 I wish to thank Professor Tryon for his kindness in allowing me to cite this as yet 
unpublished finding. 
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lowed to wander through the maze without reward for a 
number of trials. Then upon some trial at which a rewarded 
control group was well toward mastery of the problem, food 
was introduced at the end of the maze for the experimental 
group. The next trial after the rats had obtained the food 
shows a drop in the error curve to an even better score than 
that obtained for the control group (28). Not only is this 
result against the goal gradient mechanism, but also the fact 
that the distribution of errors in this rewarded period is similar 
to that for the control group. Further, the correlation be- 
tween the error order of the blinds for the non-rewarded period 
and the rewarded period is .74, and the correlation between the 
error order of blinds for a group which was hungry and re- 
warded, and the error order of blinds at a later period when 
the reward was removed is .78. This correlation seems to 
point to specific factors determining errors which are inde- 
pendent of the reward, since they determine the relative 
magnitude of errors in blinds during the unrewarded period, 
and hence are independent of the goal gradient mechanism.® 

So far we have discussed the single gradient from the goal. 
Dennis (10) some time ago published a paper indicating the 
presence of an entrance gradient in linear mazes. This latter 
finding might be used to explain the initial trial-error order of 
blinds except for the fact that its influence would be in a 
direction opposite to that of the goal gradient and hence would 
fail as a Hull type mechanism to explain the relatively high 
correlations between the first and later trial order of blind 
difficulty and is not compatible with the present formulation 
of an error gradient of negative slope. The most useful 
feature of the goal gradient concept is that it is a single 
variable, though Professor Hull by no means rules out the 
possibility of multiple goal gradients. But, if there are several 
gradients, each of which is interfunctional with respect to the 
other, the problem of determining error frequencies becomes 

8 The setting up of inhibition gradients from the empty food-box during the un- 
rewarded periods would seem to demand a reversal of the order of blind difficulty in the 
maze. This result does not seem apparent in the experiments cited. 


® Dennis did not imply a gradient mechanism, but merely pointed to the fact that 
an entrance gradient in terms of error scores is present in linear mazes. 
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very complicated and the usefulness of the concept will vary 
in inverse ratio to the number of gradients and their degree of 
dependence upon one another. 

Muenzinger, Dove and Bernstone (23) have recently re- 
ported a bi-directional goal gradient in an endless maze. 
The maze used was in the form of a square with food boxes at 
each corner. Five blinds and choice-points were placed in the 
paths between the corners. Essentially the procedure was to 
train a control group to go from one corner where no food 
had ever been present to food at the next corner. The experi- 
mental group ran from corner to corner, the food boxes in 
each corner being baited ahead of the rat’s arrival. The 
control group showed a non-reversing negative gradient for 
successive units. The experimental group shows an in- 
creasing volume of errors in successive units to the mid-point 
between food boxes, from whence the number of errors 
decreases as the goal is approached. It could perhaps be 
argued that the bi-directional gradient is really two gradients 
but they believe that they have obtained a single bi-directional 
gradient having positive and negative slopes. They ask, 
“Is Hull’s explanation of the uni-directional goal gradient 
(backward learning) applicable to the bi-directional goal 
gradient?” (23, p. 236). They assert that because Hull’s 
hypothesis rests upon goal-directed drive it is inadequate to 
account for the bi-directional gradient. ‘They say, “‘If in the 
goal-to-goal situation a goal-directed drive is present as the 
persisting stimulus for maze responses and if the animal dis- 
criminates distances according to Weber’s law then we should 
get backward elimination of errors even in the first part of the 
maze, a deduction contrary to our experimental results (23, 
p. 236). They also point out that if it is necessary to have one 
hypothesis for backward elimination of errors and another for 
forward elimination of errors it would be a very ‘unsatisfactory 
state of affairs’ (23, p. 237). 

Tryon (30) has pointed to a number of maze gradients: 
“seven ‘direction-flow’ gradients which are spatial abstrac- 
tions elicited by the configurations in the maze pattern,” 
a food position gradient, and initial inertia gradient, and an 
emotional adaptation gradient. 
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No one denies the existence of gradients in maze learning. 
They appear in the error and time scores of practically any 
learning problem, but their functions and slopes may be many 
and complicated. Their relationships to the essential prob- 
lem at hand of learning the true path of the maze, or the 
correct series of discrimination cards in a discrimination 
problem may not exist at all. Let us take, for example, the 
experiments of Krechevsky (19, 20, 21, 22) on ‘hypothesis’ 
learning in a discrimination apparatus with rats. The 
multiple discrimination boxes used in these experiments are 
similar to the linear maze set up, but more complex. It is 
this type of learning which may account for the lack of 
gradients for correct responses and, equally well, indicate 
how, if interfunction between the learning act and the 
error factors does exist, the relative weights of such factors are 
not disturbed. It is necessary to leave the weights of error- 
causing factors undisturbed because of the relatively high 
degree of relationship between the error order of blinds in the 
first trial in mazes and the error order of blinds in later trials. 
In the multiple discrimination situation, it will be recalled, the 
animal performs systematic sequences of behavior which are 
best described as attempted hypotheses for the solution of the 
problem presented to the rat. An hypothesis as to the nature 
and sequence of correct turns would function independently 
of the first trial tendencies to make errors. Such an hypothe- 
sis, if correct, is correct for the whole maze or some consider- 
able part of it. Inevitable tendencies to make errors as the 
result of the continuing first trial factors such as centrifugal 
swing, or some other such factor, would not necessarily dis- 
turb the hypothesis nor would the hypothesis necessarily 
disturb the independent error factor’s differential weights. 
Learning would consist in the hypotheses more nearly ap- 
proximating the true one as they varied from trial to trial, 
but along with this undifferentiating weighting of blinds, 
would go the highly differential error factors of the first trial. 
These latter, let us be reminded, contain some of the gradients, 
and as has been shown with the linear mazes, the error gradi- 
ents are also probably the result of some specific factor such as 
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the last correct choice of the maze, or factors which the animal 
brings to the maze. The animal learns to expect the last 
correct choice, but at the same time he is also learning the 
true path of the maze. Before he can make a perfect run, 
however, he must also learn to ignore or inhibit responses of 
the choice-point expectancy type. This fact, that two or 
more simultaneous learning problems are going on in the 
learning of the linear maze, may theoretically be carried to 
any spatial maze with the resulting hypothesis that many 
simultaneous learning problems are progressing together in 
maze learning, and that the learning of the true path may be 
learning of an entirely different nature from that which 
conditions animals to the specific expectancies, turns, gates, 
curtains, angles, etc., of the conventional maze. It does not 
seem desirable in this paper to enter into the substantiating 
experiments for this point of view, but it must be obvious to 
anyone who has run rats that the animal is confronted with 
many problems to solve in learning the maze, and that 
each problem intervening between the maze-start and the 
maze-goal may have its own intermediate gradients of errors.’ 
Some of these problems such as turns, gates, curtains, etc., 
may result in waste movements and excess time relative to 
the essential task of learning the true path of the maze. But 
these excesses of time and movement at particular locations 
in the maze may produce differential errors. In other words, 
differential speeds of locomotion as the result of an inter- 
mediate problem may produce differential errors into blinds 
(e.g., note the relationship of speed of locomotion to centrif- 
ugal swing as a determiner of errors). 

Let us now summarize the essential points of my criticism 
of the goal gradient mechanism as an explanatory principle 
in maze learning. 

1. In the linear maze we can obtain two opposed gradients 
of errors, both due to the last correct choice expectancy 
factor. The relative difficulty of blinds, where such a factor 
exists, is purely accidental to the direction of the blinds with 
respect to the last correct choice direction. 


19 In this connection, see Tolman (26) for the possible development of intermediate 
goals in the maze. 
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2. Where, in the linear maze, error gradients can be 
found, the correct choices of the maze fail to reveal any 
gradient whatsoever, which finding is opposed to the re- 
sult we should expect in the operation of the goal gradient 
mechanism. 

3. It is highly probable that the anticipatory response 
which establishes the gradient must be significant to the 
problem presented to the rat rather than that it should be a 
mere response contiguous in time or space with the con- 
summation of the ‘goal. 

4. Even in the maze running of non-hungry non-rewarded 
rats, where the persisting drive stimulus is presumably at a 
minimum, the same relative difficulty of blinds exists as for 
hungry rewarded rats. 

5. In spatial mazes, as opposed to linear mazes, the rela- 
tive difficulty of blinds is in large part set up in the very first 
maze trial, and hence is to that extent independent of goal 
gradients which must develop through maze experience. 

6. Many factors causing errors are by their very nature 
excluded from relationship with the goal, or the psychological 
functions associated with repeated experience of the goal. 

7. Changes in the location of the food, the addition of 
blinds or units, to the beginning or end of a maze does not seem 
to disturb the relative order of difficulty of blinds to any large 
extent. 

8. The fact that error patterns are set up to a large extent 
in the first maze trial is not confined to animal learning and 
to specific maze patterns. 

9. In latent learning, it is impossible to see how the goal 
gradient mechanism is operating, so that upon the next trial 
following the introduction of reward, the error scores com- 
pare favorably with a control group and the relative difficulty 
of blinds is similar. 

10. There is evidence to support the contention of multiple 
gradients in mazes. The existence of multiple gradients 
would seem to make the goal gradient mechanism, if it exists 
as a determiner of responses in the sense that Hull has used it, 
only one, and perhaps a minor determiner, of responses within 
a complex of gradient mechanisms. 

















412 JACK BUEL 





11. In maze learning there is the distinct possibility of 
many simultaneous learning problems taking place at the 
same time. Each of these problems may have learning 
mechanisms that differ from one another, or at least have 
different intermediate gradients to complicate the problem 
of learning the true maze path. 

12. The type of learning which has operationally been 
named hypothesis behavior is seen to have no gradients at all, 
and yet does lead the animal to take the true path upon 
repeated experience with the problem. 

In conclusion, it should be stated that though the criticisms 
directed at the goal gradient hypothesis are all based upon 
experiment, in large part the specific nature of the criticism 
is the result of a particular interpretation. These interpreta- 
tions may be in serious error, but altogether they seem to 
raise doubts that the goal gradient is the sole mechanism 
involved in iearning, or even a heavily weighted one. It 
may be that through the introduction of multiple gradients, 
though certainly some of them cannot be thought of as goal 
gradients, an explanation of maze learning can be obtained 
within the fundamental concepts of Professor Hull’s theory. 
These criticisms are directed at the concept at its present 
level as it relates to maze learning. 
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OPERATIONISM IN PSYCHOLOGY ! 


BY R. H. WATERS AND L. A. PENNINGTON 
University of Arkansas 


In 1927 Bridgman (3) argued that “‘one of the most note- 
worthy movements in recent physics has been a change in 
attitude toward what may be called the interpretative aspect 
of physics.”’ In illustration of this change in attitude Bridg- 
man subjected the meaning of various physical concepts to a 
rigid analysis. As a consequence of this analysis Bridgman 
showed that the customary method of defining concepts was 
inadequate. He therefore developed and defended the 
principle of operationism as the only satisfactory method for 
defining concepts in the field of physics. 

Contemporary psychology seems, in certain quarters at 
least, to be looking to the principle of operationism as a basis 
upon which its data may be interpreted, its warring factions 
reconciled, and future discord prevented. The advantages 
offered by the principle to psychology have been stressed by 
Boring (2), Stevens (12, 13), McGregor (8), McGeoch (7). 
Concrete applications of operationism have been made by 
Tolman (14), and by Seashore and Katz (11). These writers 
do not claim that operationism demands a new psychological 
school. Operationism is rather a methodological principle 
upon which all psychologists are urged to agree in order to 
achieve a solid and scientific interpretation of psychological 
data. The present paper is devoted to an examination of this 
principle in an effort to determine its validity for psychology. 
In spite of its apparent popularity we do not agree that 
operationism is the key to the final solution of psychological 
problems. Much less is it the shibboleth which will unite 
all psychologists. In short, it is our contention that opera- 
tionism as defined and applied by its proponents does not 


1 Research Paper No. 601 Journal Series, University of Arkansas. 
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and cannot furnish the framework and foundation for psycho- 
logical science. 

The notion of operationism has philosophical roots as 
Stevens (12) has pointed out, but its influence for psychology 
can be more directly traced to Bridgman’s pronouncement 
and to recent psychological papers. We can best begin our 
examination by presenting briefly Bridgman’s point of view. 
As developed by him the principle of operationism holds that 
no concept in physics means more than the set of operations 
involved in collecting the data upon which the concept is 
structured. Thus “‘the concept of length is fixed when the 
operations by which length is measured are fixed: that is, 
the concept of length involves as much as and nothing more 
than the set of operations by which length is determined . . . 
in general, we mean by any concept nothing more than a set 
of operations; the concept is synonymous with the corresponding 
set of operations” (3, p. 5). Nowhere do we find in the writ- 
ings of those who defend this principle any quarrel with this 
statement.? Apparently, then, those who uphold opera- 
tionism in psychology have accepted the principle as thus 
presented. The strictures of Bridgman’s interpretation should 
be borne in mind because of the all-too-apparent danger of un- 
consciously invoking something not included in the principle 
when applying it to concrete and specific cases. Opera- 
tionism, as this paper discusses it, will mean no more and no 
less than is contained in Bridgman’s statement. Our criti- 
cism of the principle will be based upon its psychological 
implications and will cover the following points: 1. The 
principle lacks novelty. 2. The principle does not guarantee 
unanimity in psychology. 3. The principle leads to multi- 
plication, not to reduction, of the number of concepts. 4. 
Operational definitions are broader in scope than those 
permitted by Bridgman’s statement. At the close of the 
paper we shall attempt to indicate the proper rdle of opera- 
tional techniques in psychology. 

First, is operationism in psychology something new and 


2 It should be noted that Seashore and Katz (11) have given a broader interpreta- 
tion to operationism than have the other writers mentioned. Had they all adopted 
operationism in this broad sense, this paper would have been unnecessary. 
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different? The history of psychology shows that psycho- 
logical concepts have been derived from and have referred to 
the particular experimental methods and conditions employed. 
This fact is illustrated by Watson’s revolt against the de- 
scriptive analysis of consciousness and his introduction of 
Behaviorism. Behavior, as a concept, appears from the 
literature to mean nothing more nor less than what the animal 
does under specified stimulation. Conclusions and analyses 
of studies done by the Behaviorists of the past and present 
decades show quite conclusively that these conclusions are 
couched in the terms of the experimental procedures utilized. 
A contemporary illustration of this procedure is found in 
Tolman’s paper, An Operational Analysis of ‘Demands’ (14). 
Tolman’s concept of ‘demands’ is rigidly defined in terms of 
specific experimental variables. The other contemporary 
schools of psychology, Functionalism, Structuralism, and 
Gestalt, though differing in systematic approach, furnish 
additional evidence that the general use of operationism as 
experimental methodology is not new. 

Unless there be some magic chemistry in the formulation 
of new terms operationism seems to us to be synonymous with 
methodism. Looked at in this way operationism turns out 
to be as old as the experimental method itself. The learning 
curve is drawn and described in terms of the methods used in 
collecting the data. The curve of retention is likewise de- 
termined by the method employed. Conditioning refers to 
a particular method of experimental procedure. The con- 
cepts learning, forgetting, retention, intelligence, motivation, 
individual differences, and personality, and others discussed 
by Carr (4) get their meaning from the methods and condi- 
tions of experimentation. 

To point out that there is nothing new in a proposal is, of 
course, a weak type of argument. There is something new in 
this proposition. Operationism differs in principle from 
customary experimental procedures in its insistence that a 
concept mean nothing more than, or be synonymous with, 
the laboratory methods necessary to demonstrate it. The 
proponents of operationism instruct us to establish harmony 
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in psychology by defining our concepts only in terms of 
particular sets of laboratory operations. Agreement be- 
tween psychologists, they argue, can be found only by so 
doing. It is this attitude which constitutes all that is new in 
operationism. 

The second question, then, would seem to be whether this 
novelty does guarantee unanimity in psychology. Will 
operationism’s insistence upon the nature of definitions be ac- 
cepted by psychologists? Will they agree that operationism 
does furnish the basis for unification and clarity? History 
answers these questions in the negative. The introduction of 
behavioristic methods has not unified psychology. In spite 
of its vigorous claims Gestalt psychology has not enrolled all 
psychologists under its banners. More particularly in spite of 
the development of better methods the controversies over the 
nature of learning, the relation of learning and conditioning, 
the nature of retention, of mind and mental activity, still 
rage. In view of these historical antecedents and since 
operationism means ouly a stress of generally practiced experi- 
mental method, it is difficult to see how such a method can 
act as a conciliatory principle among the various systems of 
psychology, and so result in unification. 

Specific illustrations of this conclusion are not difficult 
to find in the literature. Numerous investigators have ac- 
cepted sets of laboratory data and have built thereon differing 
theories or concepts. The experimental data accumulated by 
Lashley (6) and his students on the problem of brain struc- 
ture and function is a case in point. This group of workers 
has proposed the theory of equipotentiality to account for the 
results obtained under laboratory conditions. Hunter (5) 
and others, accepting these data, have disagreed with Lash- | 
ley’s theoretical stand and have supported an entirely differ- 
ent interpretation. These variant concepts of brain function, 
in this instance developed from common data, point to a 
characteristic of all research, namely, the rdle of theorizing and 
generalization in science. These two divergent conceptions 
originated with the operationally given and they illustrate 
the impossibility of achieving unanimous agreement on that 
basis in psychology. 
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A second striking example of rival interpretations of the 
operationally accepted is found in Tolman’s (15) considera- 
tion of the acquisition of string-pulling behavior by rats. 
These animals under laboratory conditions learned to pull- 
in-can-for-food. This operational and laboratory definition 
of the behavior manifested serves only as a point of departure 
for Tolman. The major portion of his treatment of this 
stimulus-response relationship contains a discussion of the 
relative superiority of the sign-Gestalt theory of learning 
over the conditioned response theory. These and other 
illustrations indicate the impossibility of unanimity among 
psychologists, much less the reconciliation of systems by way 
of operationism. Proponents who expect this result are not 
likely to see it realized when concepts already operationally 
defined fail to bring general agreement. 

The third point which may be raised against operationism 
is the possible multiplication rather than the reduction of the 
number of concepts resulting from its general application. 
The fact that the concept, for Bridgman, is synonymous with a 
corresponding set of operations cannot be over-emphasized. 
The slightest change in any aspect of a set of operations would 
mean, therefore, a new concept and would demand, like- 
wise, a new symbol for its designation. A multiplicity of 
concepts could scarcely be avoided. It is clear to us that 
earlier arguments for the application of operationism in 
psychology reduce to an argument for the general acceptance 
of the experimental method. This reduction comes about as 
a result of applying Bridgman’s principle too loosely. The 
following illustrations of the strict application of Bridgman’s 
interpretation should clarify the point at issue. Let us start 
with the concept of length. Can we on the basis of opera- 
tionism develop the concept of length apart from the length 
of a particular object, or from a particular set of operations? 
Suppose we measure a table and get a certain result. We 
measure it again and get a slightly different value. Now do 
we have one length or two? Do we have one or two tables? 
Can a table have different lengths at the same time (as would 
happen if two people measured the table at the same time)? 
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We cannot see how operationism, as defined by Bridgman, can 
give any other than positive answers to these questions. One 
set of operations yields one length. Another set of operations 
yields another length. If length is to be ‘synonymous with 
the corresponding set of operations’ then we must have two 
lengths for the same table, or two tables. With two concepts, 
given by two sets of operations, we must have two terms to 
denote them. 

A second illustration may be taken from the field of 
psychology. ‘The phenomenon of color blindness is a case in 
point. We test a subject by means of the Holmgren Yarn 
Test. ‘Then we use the Ishihara or any of the other color 
tests on the market. In each instance some degree of color 
defect is found. Since the operations in each case are differ- 
ent, we have, according to Bridgman, two concepts of color 
blindness and no basis for developing a unitary concept such 
as is assumed in visual theory today. Likewise, the unitary 
concept of after-imagery could not be developed on the basis 
of specific and concrete after-images of the laboratory. 
Any number of other illustrations will occur to the reader. 

All these illustrations lead inevitably to the conclusion that 
every set of operations demands a different concept and, 
consequently, a new term. Bridgman’s principle, strictly 
applied, could lead only to an increase, never to a decrease, 
in the number of concepts. Thus, it may be that the cause 
of the failure of operationism and of operational definitions 
to represent the key to the solution of scientific problems lies 
in the very nature of science itself. Scientific endeavor con- 
sists not alone in the amassing of detailed facts, but more 
importantly in systematizing and in organizing these facts 
to make a unified whole. The basis of this unification consists 
in the development of conceptual tools in the form of princi- 
ples and concepts on which basis the particular and concrete 
facts may be envisaged as special cases. Since operational 
definitions, by their very nature, can never take us beyond 
isolated and discrete facts, science by operationism cannot 
get from particular to general concepts. 

A fourth criticism made earlier by Russell (10) seems to be 
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particularly pertinent for psychology. The point has to do 
with the meaning of experimental error. Russell’s argument, 
with which we would agree, is that for operationism this term 
is meaningless. Let us return to our earlier illustration of 
the table. We come out with two sets of measurements. 
On operational theory one set is as true and valid as the other, 
so both must be accepted. Of course, this does not actually 
happen in any particular case—neither set of operations is ac- 
cepted. Why? Obviously because we have learned that 
there is always a certain amount of error in our measurements. 
What we do is to strike an average and determine the amount 
of fluctuation in our measurements. The length of the table 
thus becomes sixty plus or minus .12 inches. Thus the mean- 
ing of ‘experimental error’ is simply that no single set of 
operations is accepted as valid—that the truth of the matter 
is not accurately revealed by any such set—but that each set 
approximates the truth. The rule is that the smaller the 
variation in results given by several sets of operations the 
nearer the scientist is to reality. But reality lies outside of 
and beyond the operations particular to any event. The 
history of the application of statistics to psychological ob- 
servations gives ample evidence of this fact. None of us 
dares publish experimental data unless we attach thereto a 
sign to indicate the probable experimental error. Thus, 
Russell’s conclusion that the term ‘experimental error’ would 
have no meaning in operationism is well founded. 

As a corollary to the above it follows that the meaning of 
any concept is not exhausted by the concrete physical opera- 
tions utilized in its demonstration. This conclusion is im- 
plied in the statement frequently made that operations can- 
not be defined by operations. Stevens, it is true, has urged 
that operations may be operationally defined as “‘the per- 
formance which we execute to make known a concept” 
(13). But this statement will scarcely satisfy the critic. 
It sounds very much as though we had a knowledge of the 
concept by some means other than operations and that now we 
were simply attempting such and such a performance as will 
reveal our meaning to another. How else do we know our 
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performance is adequate? Operationism does not give us a 
basis for testing operations, neither does it offer us any means 
for testing their validity as we have shown above. Unless 
we have some knowledge of a concept other than that given 
by a specific and concrete set of operations we cannot tell 
whether the operations are adequate to the task at all. 
The only way we have of choosing between different sets of 
operations is by knowing something about the definition or 
meaning of the concept in terms of its properties, a procedure 
not allowed by operationism as we are discussing it. 

We may illustrate this argument with one or two examples. 
Stevens asserts that the concept of animal spirits was rejected 
or could have been rejected earlier had the operational concept 
of definition been utilized. We must look upon his argument 
with some amusement since at almost every point it goes 
beyond the assumptions in operationism. ‘The real cause of 
the rejection of the notion of animal spirits, we argue, was not 
the operations by which they were studied but because of the 
fact that some other criterion—some other notion of the 
truth—was more consonant with the scientific temper of the 
times. 

A second contemporary example is to be found in the 
present controversy over extra-sensory perception. By per- 
forming certain operations Rhine (9g) has obtained certain 
results. His method seems to yield results which indicate the 
reality of the concept of mental telepathy. Are we at one in 
accepting the notion? Are we not more nearly unanimous 
in rejecting it? Why? Not because of operationism but in 
spite of it. Give us a satisfactory theory of the modus 
operandi of mental telepathy and we will accept the verdict 
of his operations. Our inability to formulate a satisfactory 
theory is the chief cause for rejecting the notion in spite of the 
fact that operationism would dictate the opposite. It will 
hardly do to say that better methods will establish or disprove 
it. On what grounds could this argument be made? The 
fact that we can speak of better, or poorer, methods means 
that we have some notion of the concept over and beyond 
the operations themselves. This supports the general 
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notion that concepts and their corresponding operations 
interpenetrate. 

In view of the preceding discussion the question may be 
raised as to the place of operationism in psychology. Our 
answer to this question is that operationism merely re- 
emphasizes the need for the application of the experimental 
method to psychological problems. But not all psychological 
problems are quantitative in nature, hence other than quanti- 
tative methods will have to be utilized. Thus Bridgman’s 
interpretation is far too narrow for general application to 
psychological concepts. If operationism, as a methodo- 
logical principle, is to be applied to psychology, we feel that 
mental operations must be made coordinate with physical 
operations. Boring’s quandary about the felt immediacy 
of temporal perception illustrates our point. In disposing of 
the problem he says “‘the trouble seems to be that what is 
introspectively obvious is not necessarily true” (2, p. 522). 
Now the truth of the matter seems to be that by introspective 
methods we experience a felt immediacy of temporal percep- 
tion and that by a different method, that of the physical or 
quantitative sort, we get no such immediacy. Our argument 
is that introspective methods are operational in character. 
By what right do we reject them in favor of other operational 
procedures? We will have to be satisfied with the finding 
that mental and physical, qualitative and quantitative, 
methods do not yield the same result. This fact, however, 
does not justify the rejection of one method in favor of 
another. This codrdination of the physical and mental 
operations has been stressed by Benjamin (1) who argues that 
inference and generalization are valid operations. 

Operational definitions serve their best function when they 
point out the laboratory conditions and methods by way of 
which the concept may be illustrated. What instructor in 
psychology does not fall back on this method of explaining 
the meaning of concepts to his class? We all find that we are 
materially assisted in making clear to a group the concepts 
conditioned reflex, suggestion, retroactive inhibition, etc., 
if we describe the particular experimental means employed in 
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the laboratory study of these phenomena. But every in- 
structor is keenly aware of the fact that the student’s ability 
to recite or duplicate these procedures does not guarantee his 
complete understanding of the meaning of the corresponding 
concepts. ‘These concepts, in other words, embody a wealth 
of meaning not given in these methods. 

This interpretation and use of operational procedures 
can be useful in psychology. Even so it would not, we be- 
lieve, lead to unanimity among psychologists. Different 
psychologists will continue to use and defend different methods 
of experimentation. Likewise they will continue to develop 
and defend different theoretical conceptions derived from 
identical operational procedures. 
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PSYCHOLOGICAL PHYSIOLOGY 
BY CARROLL C. PRATT 


Rutgers University 


The statement may be taken as almost axiomatic (mean- 
ing by axiomatic that it is not necessary or worth the trouble 
to justify the statement) that all explanatory principles in 
psychology must eventually be capable of formulation in the 
language of physiology. The day is long since past when 
psychological data were regarded as constituting an unique 
realm of observational material requiring an indigenous style 
of self-sufficient explanatory principles. All psychological 
events are functions of bodily process, and their explanation 
must accordingly fall back upon the language of that science 
which takes bodily process as its subject-matter. The day is 
still indefinitely far in the future, however, when physiology 
will be able to offer concepts to psychology which are adequate 
for more than a limited number of the large array of facts 
established by modern experimental psychology. 

In view of this state of affairs, psychologists are frequently 
tempted to make up their own physiological concepts, only 
to discover that in certain quarters their efforts are dubbed 
brain mythology rather than brain physiology. To ward off 
a sense of inferiority, some psychologists, even among the 
radical behaviorists, recommend a retreat, a return to the 
position advocated many years ago by Titchener, v1z., that 
psychological observations are self-sufficient and require no 
physiological supports. To the majority of present-day 
psychologists, however, it must be obvious that such a 
position is hardly tenable. 

The slur implied by calling the physiology of some psy- 
chologists brain mythology is generally not warranted. 
All scientific concepts are constructions built up by the rules 
of logic from the uniformities found to exist among the initial 
data of observation. If the. initial data are functions of 
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unknown or as yet inaccessible events, as psychological data 
in many instances most certainly are, then the concepts 
based on them may be regarded logically as in indication of 
the formal properties of the unknown determinants. 

Formal properties are matters of inference, and may be 
contrasted, for purposes of linguistic convenience, with 
material properties, which are the results of direct observation. 
In psychology, direct observations have to do with colors, 
sounds, times and magnitudes of response, number of repeti- 
tions required to recite a list of syllables, number of errors 
made by an animal in a maze, the change in appearance of 
images after a lapse of time, and soon. These items are all 
material psychological properties. They constitute the in- 
itial data of observation, but they also serve as a springboard 
from which to make a scientific jump into the unknown. 

The uniformities among psychological data are presum- 
ably what they are because of their relation to physiological 
determinants. The physico-chemical nature of these de- 
terminants may be and at present usually is unknown; but 
their formal nature must be indicated by the particular kind 
of uniformities revealed by their correlative psychological 
terms. If, therefore, the linguistic summary of material 
uniformities and relationships is couched in, or implies the 
concepts of physiology, such an extension beyond bare state- 
ment of fact is wholly in keeping with the procedure followed 
by all sciences in the construction of their concepts. If such 
concepts-by-extension do not leap too far beyond the ma- 
terial properties upon which they are based, their scientific 
status can hardly be called into serious question, for they 
amount to little more than a restatement of the initial observa- 
tions in a different terminology, usually in the terminology 
of that science which is one step nearer to the basic construc- 
tions of physics. 

The validity of physiological concepts used by psycholo- 
gists has at least two obvious tests. (1) They must not be 
inconsistent with any known physiological fact. (2) Their 
meaning must be discovered in the process of tracing their 
logic back to the initial observations. Any meaning not 
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thus discoverable is at least gratuitous; at worst, it is scientific 
nonsense, 

The complaint that such concepts are tautological, inas- 
much as they merely restate in different language what has 
already been observed as fact, is not warranted, unless one 
wishes thereby to condemn as futile the whole system of 
scientific explanation. All scientific explanation is circular, 
but not necessarily vicious. It consists of filling in gaps by 
making guesses suggested by the very nature of the gaps 
themselves. What the material properties of the missing 
items are may be completely unknown, but an attempt can 
always be made to deduce their formal properties from what 
is known. An example of such a procedure can be drawn 
from the work of the physiologists themselves. 

In 1906 Sherrington cited eleven characteristic differences 
between conduction in nerve-trunks and in reflex arcs, and 
then proceeded to locate these differences at the surfaces of 
separation at the nexus between neurone and neurone (4). 
These differences then became the properties of the so-called 
synapse. It should be noted, however, that the synapse 
was never observed, nor were any events at the nexus between 
neurone and neurone recorded. The observations consisted 
solely in noting the time-relations between application of 
stimuli and end-effects in muscle. These material properties 
were converted into formal properties by the simple expedient 
of assigning them to a fictional entity, the synapse. Nothing 
could be more circular, but apparently only psychologists 
are troubled by such circularity. ‘‘The fundamental im- 
portance of the synapse,” writes Lashley, “‘seems a logical 
conclusion, yet we must bear in mind when evaluating theories 
of learning that the properties of the synapse are still entirely 
hypothetical. If we deduce its properties from the facts of 
learning, we gain nothing by explaining learning in terms of 
these hypothetical properties”’ (2). 

Lashley is plainly bothered by the circularity involved 
in explaining learning by concepts that are constructed from 
the very facts that the concepts are supposed to account for. 
But how else can explanation progress? How else can any- 
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thing be gained scientifically? There is nothing mystical 
about scientific explanation, except to those who believe that 
it can do more than summarize the relationships that obtain 
in a network of observed items. If some of the items are 
missing, or cannot be observed, an explanation which by 
inference includes these items is clearly an hypothesis. As 
soon as the items are observed and correlated with what is 
already known, the hypothesis is proved ‘correct.’ But 
whether an hypothesis is correct or incorrect, it is almost 
inevitably circular, for the explanatory principles contained 
within it are usually suppositions indicated by the very facts 
which the hypothesis is designed to illuminate. The synapse 
is no different in this respect from any other scientific concept. 
Another example, taken from astronomy, illustrates the 
circularity of scientific concepts even more clearly. 

Soon after the planet Uranus was discovered, it was found 
that its observed positions did not permit the calculation of an 
orbit which could be accounted for by the known motions of 
Jupiter and Saturn. It was at first assumed that the difficulty 
was caused by inaccurate observations, but several years of 
careful checking failed to eliminate the discrepancy between 
fact and theory. The Reverend T. J. Hussey, an English 
amateur astronomer, first made the suggestion that the un- 
accountable deviations in the orbit of Uranus must be due to 
the action of a still more distant planet. Professional 
astronomers eventually took up the suggestion and calcu- 
lated the sort of orbit an ultra-Uranian planet would have to 
describe to produce the observed positions of Uranus. For 
several years the calculated planet was regarded as a reason- 
able explanation of the motions of Uranus. Could circularity 
be more perfect? The formal properties of the guessed planet 
were derived from the material properties (1.¢., the observed 
positions) of Uranus, and then these same forma! properties 
were used to explain the observed positions of Uranus. 
Finally, the calculated planet was actually observed in the 
predicted positions. A material property, now called Nep- 
tune, confirmed the accuracy of the formal properties of an 
unnamed fictional concept. 
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The motions of psychological events describe orbits which 
can not be satisfactorily accounted for without far greater 
knowledge of the nervous system than is at present available. 
Nevertheless these same psychological events, when subjected 
to the controls of scientific observation, must give hints as 
to the manner in which their underlying neural correlates 
operate. If these hints are formulated in accordance with 
the imagination allowed by the rules of scientific logic, there 
can be no possible objection to their use as the sort of physio- 
logical concepts that psychologists find it profitable to work 
with. A single example, taken from the field of psycho- 
physics, will be enough to illustrate what is meant. 

If a series of comparison-stimuli, symmetrically distributed 
about the standard, is presented to an observer for judgment, 
it turns out that if the interval between the standard and 
comparison is more than four seconds, a marked preponder- 
ance of judgments in the direction of increase (heavier, louder, 
brighter, as the case may be) will be given (3). This fact, 
as sheer fact, has been known ever since the earliest days of 
psychophysics, but it remained for Kohler to give it meaning 
by treating it as an indicator of the physiology of mnemonic 
traces (1). The trace is a formal concept. What its true 
physiology is, no one knows; but everyone would be justified 
in assuming that its physiological mechanisms must have 
characteristics which account, among other things, for the 
preponderance in psychophysics of judgments of increase over 
those of decrease. By extending the implications of psycho- 
physics, Kohler fertilized a field that had lain fallow for years. 
The facts were there, but not enough was done with them. 
Placed within a physiological context, they achieved more 
lively significance. It may be argued that facts in and of 
themselves should be enough. The history of science seems 
to show, however, that a little planned speculation is far more 
important than any amount of indiscriminate collecting of 
facts. 

Psychology has more facts than it can properly handle. 
Relief from such an embarrassment of riches may be secured, 
not by blindly collecting still more facts, but by forcing those 
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already at hand into a framework of formal physiology. 
Psychology cannot afford to wait for physiology to catch up 
with it. Psychologists must construct, by means of the 
usual logic of science, their own psychological physiology. 
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A NOTE ON GUILFORD’S GENERALIZED 
PSYCHOPHYSICAL LAW? 


BY CARL IVER HOVLAND 
Department of Psychology, Institute of Human Relations, Y ale University 


Guilford (3) has proposed that both Weber’s and Cattell’s 
(2) psychophysical laws be subsumed under a general power 
function of the form AR, = KR,”. For Weber’s law the n 
would be equal to 1, for Cattell’s law it would be .5. An 
attempt was made by him to apply the power assumption to 
a number of studies in the field of perception. 

Data from a study of Robinson and Robinson (5) were 
employed. In this experiment the subjects were required to 
draw lines corresponding to various standard lengths which 
were presented to them. Forty lines were drawn for each 
of the ten standards. The standard lines ranged in length 
from six to thirty-three centimeters in steps of three centi- 
meters. From the data presented by the authors, Guilford 
computed the average length and standard deviations of the 
lines drawn in reproducing each standard length. For these 
data Woodworth’s (6) formula ? 


A(R, aa R:) = [ (AR)? + (AR;)? bk 2r12(AR;)(AR2) |! 


clearly fails to hold. Woodworth’s law is based upon the 
assumption that uniform correlations will obtain within the 
same kind of stimuli, but the data of Robinson and Robinson 
show that the greater the difference between the standard 
lengths used, the lower the correlation between the repro- 
duced lines. The discrepancies between the observed stand- 
ard deviations and those predicted from Woodworth’s 

1The writer wishes to thank Professor J. P. Guilford for critical reading of the 
manuscript. He also wishes to acknowledge the assistance of R. L. Wanamaker, J. 
Miller, and C. J. Hill in the measurement and tabulation of the data. 


* After Guilford (p. 74). Woodworth’s original formula is 27 = 9;* + 0%: + arn» 
in which » and 9 are the variabilities of the components and », that of the compound 


(p. 71). 
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equation were all positive and very large. This and other 
considerations led Guilford to conclude that the correlation 
between the measurements (r;2) in Woodworth’s formula was 
superfluous. As a substitute he proposed to fit a power func- 
tion of the form described above to the data. 

The data of Robinson and Robinson and the function 
derived by Guilford for these values are reproduced graphic- 
ally in Fig. 1. The constants used by Guilford for K and n 
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are .§76 and .588 respectively. It is apparent that from these 
data it is difficult to ascertain the true function. Marked 
deviations of the obtained points from the theoretical values 
are found. In fact, a straight line could equally convincingly 
be applied to the values. When AR = 5.989 + .0349R the 
correlation * between the theoretical and observed values is 
.987. This even exceeds the correlation of .970 between 
observed points and corresponding points on the curve ob- 
tained by Guilford with a power function. But it is of course 


® Here and elsewhere the correlation has been corrected for the number of points 
involved, cf. Ezekiel (1), p. 121. 
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true, as Guilford asserts, that there is a “stubborn theoretical 
demand that the curve shall pass through the origin.” 4 
The discrepancies are explained by him on the basis of an 
‘end effect’: “It may be that the limited series of lengths 
used as standard stimuli produce a kind of closed system, and 
that variations in judgments are influenced by this system, 
and that the influence is greatest at the ends of the series.” ® 
It also appears possible that the results were influenced by the 
fact that the paper strip upon which the standard lines were 
drawn was only forty-five centimeters wide and that cues 
may thus have been afforded with the longer lengths of lines. 

Guilford points out that to determine adequately whether 
or not such data may be satisfactorily fitted to a power func- 
tion “‘it still remains to find out what happens to the function 
with very high and with very low values of R.”* Data for 


MEANS AND STANDARD DeEVIATIONS OF THE REPRODUCTION OF LINES. 
Forty Lines or Eacu Lenctu Drawn sy Eacu Susject 


Length of Mean Length Standard Deviation 
Standard Reproduced (R) (AR) 


cm. cm. cm. 

3.0 3.89 67 

6.0 7.55 1.05 

Group I 18.0 20.24 2.51 
(40 Subjects) 33.0 35.58 3.72 
63.0 65.99 6.85 

72.0 76.04 7.32 


3.0 3.84 67 
6.0 7.69 1.13 
Group II 18.0 20.11 2.74 


(40 Subjects) 33.0 35-37 4-29 
63.0 65.24 6.90 
72.0 75-51 7.52 


3.0 3.86 67 
6.0 7.62 1.12 
Combined 18.0 20.17 2.64 
(80 Subjects) 33.0 35-47 4-04 


63.0 65.61 6.93 
72.0 75-78 7-46 
such determinations were obtained by the present writer in 
connection with another problem. Lines as short as three 
centimeters and as long as seventy-two centimeters were 
4P. 79. 


5 Pp. 79-80. 
*P. 8s. 





GUILFORD’S GENERALIZED PSYCHOPHYSICAL LAW 433 


employed. Six standard lengths were used in the computa- 
tions. Four other lengths were inserted to make the series 
appear continuous to the subjects. Forty determinations 
were made with the lengths of three, six, eighteen, thirty-three, 
sixty-three and seventy-two centimeters. The background 
for the black lines was uniformly white and was a field of 
thirty by one hundred and fifty centimeters to prevent any 
cues afforded by the distances to the end of the standards. 

Eighty college students were employed as subjects. To 
permit analysis of the consistency of the results, the subjects 
were divided into two groups of forty each. The average 
lengths of lines drawn, with the standard deviations of the 
reproductions, are given for these groups in the table. The 
data for the entire group of subjects are presented graphically 
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in Fig. 2. The formula for the curve of best fit for these data 
is AR = 0.2179 R-*4%, The goodness of fit is indicated by a 
correlation of .g98 between the observed and theoretical 
values. The formulae for the lines of best fit for the separate 
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groups of subjects are AR = 0.2104 R-*!8 for Group 1 and 
AR = 0.2192 R-8%* for Group 2 respectively. This indicates 
considerable consistency in the data. From these results it 
appears that Guilford’s power function fits such data very 
adequately, not only in the central portions of the curve where 
the linearity permits Weber’s law to be a fair approximation, 
but also at the extremes, where Weber’s law is completely 
inadequate.’ 
SUMMARY 


The data of Robinson and Robinson (5) used by Guilford 
(3) are unsatisfactory in testing the adequacy of the general- 
ized psychophysical function proposed by him, since within 
the narrow range of stimuli employed a straight line is shown 
to be as adequate or even more adequate than the power 
function employed. Data compiled by the writer with an 
improvement of the Robinson technique and with a much 
wider range of stimuli tend, however, strongly to support 
Guilford’s generalized law, the relationship being close be- 
tween theoretical and observed values not only in the central 
region where Weber’s law is fairly adequate, but also at the 
extremes where Weber’s law is so completely unsatisfactory. 
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THE RELIABILITY VS. THE VALIDITY 
OF TEST SCORES 


BY HARVEY A. CARR 
The University of Chicago 


For purposes of discussion, the meaning of these two terms 
will be exemplified in terms of a concrete illustration. A 
test is devised to measure insurance salesmanship. A 
typical group of such salesmen is then selected. A sales 
record for an appropriate period of time is secured for each 
member of the group. The group is then subjected to the 
test, and a corresponding group of test scores is secured. 

The validity of a test is usually defined as the degree to 
which the test measures that which it purports, or is designed, 
to measure. Our hypothetical test was designed to measure 
salesmanship, and hence it is valid in so far as it accurately 
measures this ability. It is assumed that our sales records 
constitute a direct and accurate measure of individual differ- 
ences in sales ability, and hence our group of test scores are 
valid for this purpose in so far as they correlate with this 
criterion of salesmanship. Validity is sometimes defined 
as that which is measured by the correlation between the test 
and its criterion. 

The reliability of test scores is usually defined as the degree 
to which they can be reduplicated on a second administration 
of the test. Consequently the reliability of a test is measured 
in terms of the correlation of the scores for the two administra- 
tions. Reliability is often defined as that which is measured 
by the correlation between two administrations. Often a 
second administration is not feasible, and then one of two 
substitute methods is used. The first is the correlation be- 
tween the scores for two equivalent forms of the test, and the 
second is an inter-item correlation within a single administra- 
tion. However we shall continue to refer to reliability 
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as denoting the correlation between the scores for two 
administrations. 

The meaning of these two terms, especially that of relia- 
bility, can be further analyzed. The concept of reliability 
was discussed in a previous paper.!. The word reliability can 
be transposed into the phrase ‘the ability to rely upon.’ 
The problem can be stated in terms of either one of two 
equivalent questions: 1. How reliable are these test scores? 
2. To what extent can we rely upon these scores? The latter 
form of question has the advantage of rendering evident the 
fact that the reliability of these scores is a function of the 
use to which we put them. A set of scores in and of them- 
selves are neither reliable nor unreliable. These scores 
possess the attribute of reliability only when they are used for 
some purpose, and their degree of reliability must be evalu- 
ated in terms of that use. 

Our test procedures are so arranged that the obtained 
distribution of scores is to be used as a measure of individual 
differences in test ability—the ability to pass this test. The 
degree of reliability of these scores for this purpose is a func- 
tion of the accuracy of these measurements. The problem 
of the degree of reliability of these scores thus reduces to 
that of their degree of accuracy in measuring test ability. 

How accurately do these test scores measure test ability? 
The criterion of accuracy is a function of the meaning of the 
concept of ability. Ability is a predictive concept. An 
individual’s ability denotes not only what he has done, but 
also what Le may be expected to do on the next day and the 
day thereafter. Under the conditions under which the test 
is given, the degree of accuracy with which the scores measure 
the relative test ability of the members of the group is a 
function of the degree to which they correlate with the scores 
for a second administration. The size of the correlation 
between two administrations is thus an index of the accuracy 
and the reliability of these measurements. 

The validity of the test scores is measured by their cor- 
relation with the sales records. What does validity denote 


1H. A. Carr, The search for certainty, Psycnor. Rev., 1937, 44, 274-296. 
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in this concrete situation? A valid measure is a true measure, 
and to be true that measure must be accurate. These scores 
are valid measures in so far as they are accurate measures of 
sales ability. It is assumed that the sales records constitute 
a direct and accurate measure ot individual differences in 
sales ability, and consequently the test scores can also be 
used as an accurate measure of sales ability in so far as they 
correlate with the criterion of that ability. The correlation 
of the test scores with the sales records is thus an index of the 
accuracy and validity of those measurements. 

In this analysis two problems are encountered. 1. How 
accurately do these test scores measure test ability? 2. How 
accurately do these test scores also measure sales ability? 
The two problems are alike in that they involve the same set 
of scores and the question of the accuracy of measurement. 
They differ only in respect to what is being measured and the 
criterion of accuracy. It may be noted that both problems 
have just been stated and differentiated without any recourse 
to the two terms of reliability and validity. 

According to the conventional test usage, the first of these 
problems has been denoted as one of reliability, and the second 
as one of validity. Reliability and validity have thus been 
used as differentiating or distinguishing terms. We may now 
note that this test usage of the two terms differs considerably 
from their popular usage. 

The reliability of a set of scores is a function of their use, 
and their degree of reliability must be evaluated in terms of 
that use. We may now note that this same set of scores is 
used for two purposes—as direct measures of test ability, and 
as indirect or substitute measures of sales ability—and that 
their reliability can be evaluated for either or both purposes. 
The term reliability is just as applicable to one problem as to 
the other. The scores are reliable measures of test ability in 
so far as they correlate with the criterion of that ability, viz., 
the scores on a second administration, and they are also 
reliable measures of sales ability in so far as they correlate 
with the criterion of sales records. These scores may be 
highly reliable for one purpose, and possess but little or no 
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reliability for the other. These scores are not either reliable 
or unreliable. There is not just one problem of reliability. 
There are as many problems of reliability as there are uses to 
which these scores are put. 

The term validity is also equally applicable to both 
problems. A valid measure is one that is accurate. These 
scores are valid measures of both test and sales ability in so 
far as they are accurate, and their degree of accuracy in each 
case is a function of the size of their correlation with the 
appropriate criterion. The validity of a set of scores is also 
a function of the purpose for which they are used, and they 
may possess a high degree of validity for one purpose, and 
but little validity for another. 

When applied to the field of measurement, reliability and 
validity are two highly synonymous terms that refer to 
accuracy of measurement from two slightly different points 
of view. It may be noted that they refer to accuracy of 
measurement as abstracted from what is being measured. 
In their popular meaning, neither term has the slightest 
reference to the object of measurement. In test practice, 
however, reliability and validity are used to distinguish 
between two problems of measurement that differ only in 
respect to what is being measured. The scores are reliable 
when they are used to measure test ability, and they are valid 
when they are used as measures of sales ability. Reliability 
and validity are thus two distinct and independent attributes 
of a set of test scores in virtue of the fact that each refers to a 
different object of measurement. Obviously the test usage 
of these terms differs considerably from that of their popular 
significance. 

Reliability and validity are quite abstract and impressive 
terms. The arbitrary use of such terms in ways that deviate 
widely from their popular meaning may be conducive to an 
abstract type of thinking as a means of avoiding any confu- 
sion in respect to their concrete significance. In the interest 
of clarity and precision of thinking, it is suggested that it 
would be better to distinguish between these two problems 
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in terms of what is being measured, rather than to make the 
distinction by means of these two abstract terms. 





Attention may now be called to certain necessary relations 
obtaining between the so-called coefficients of reliability and 
validity. In a test situation of the sort under discussion, the 
size of the coefficient of reliability must at least equal that 
of the coefficient of validity. The coefficient of reliability 
may be much greater than that of validity, but it cannot be 
distinctly less provided that all of the computations involved 
are correct. It is thus possible to make certain legitimate 
inferences concerning the reliability of a test on the basis 
of its validity, and vice versa. The reliability of a test must 
be equal to or greater than its degree of validity, and its 
validity must be equal to or less than its degree of reliability. 

The truth of these propositions can be derived from an 
analysis of the significance of the reliability coefficient. 
Under the conditions of the test the obtained distribution 
of test scores is a function of chance factors and certain 
constitutional differences between the individuals. The cor- 
relation of this distribution of scores with that; of a second 
administration is computed in order to differentiate between 
the relative influence of these two groups of factors. A zero 
or a small correlation indicates that the obtained distribution 
is primarily a function of the group of chance factors. Ob- 
viously a chance distribution of scores cannot exhibit a high 
degree of correlation with a group of sales records. At least 
the probabilities are distinctly against this hypothesis accord- 
ing to the accepted principles of statistical logic. A high 
degree of correlation with the sales records can thus be ex- 
plained only on the assumption that the distribution of test 
scores exhibits an equal degree of functional dependence 
upon the constitutional differences between the individuals. 

In experiments such as this, it is test practice to compute 
and publish both correlations. The practice is so thoroughly 
established that some individuals seem to regard it as a 
categorical imperative. One sometimes gains the impression 
that the actions of these individuals are apparently based upon 
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the implicit assumption that the reliability coefficient helps to 
establish the worth and significance of the validity coefficient, 
or that the reliability coefficient in some way validates the 
reliability of the validity coefficient.? It is needless to say, 
however, that this implicit assumption is neither valid nor 
reliable. 

To what extent is this practice of computing and publish- 
ing both correlations necessary and justifiable? 1. Let us 
suppose that the experimenter first canvassed the various 
existent tests in order to select one to be used as the indirect 
measure of sales ability. The size of the reliability coefficient 
would be one factor determining such a choice. Tests of low 
reliability would necessarily be discarded, for their coefficients 
of validity can not be materially greater than their coefficients 
of reliability. 2. Let us assume that the test is selected, or a 
new one constructed, and that the experiment is performed. 
Why compute the reliability of the test on this group of sales- 
men and publish both correlations? There are two cases 
involved. a. Let us assume that the coefficient of reliability 
is equal to or less than the coefficient of validity. This fact 
merely serves as a check upon the correctness of the computa- 
tions involved. However, our computations of the validity 
coefficient—with which the experiment is primarily con- 
cerned—can be just as easily and readily checked by a process 
of recomputation. 5b, Let us assume that the reliability 
coefficient is much larger than that of validity. If the com- 
putations involved are correct, this fact merely gives us the 
additional) knowledge that we are here dealing with two 
relatively independent abilities. 


[MS. received February 3, 1938] 


* The latter form of statement has been used to illustrate the great variety of 
legitimate meanings of these terms. As another illustration, we may note that it is 
perfectly legitimate to speak of the reliability of a reliability coefficient. 















DR. BRITT ON THE CONCEPTS OF LEARNING, 
RETENTION, AND REPRODUCTION 


BY HULSEY CASON 


University of Wisconsin 


Dr. Britt! has recently dissented from my description ? 
of the concepts of learning, retention, and reproduction, and 
he states that we differ on at least ten fundamental ‘points of 
view.’ 

Reification.—Dr. Britt claims that I have used the terms 
learning and memory as if they were nouns corresponding to 
entities or real things, and that I am guilty of reification. 
The rarely used word ‘reify’ means ‘To treat as or make real; 
make concrete,’ and both Dr. Britt and I have used the words 
learning and memory as nouns and I think also as real. But 
Dr. Britt and I also use these terms to refer to processes, 
even if there are some psychological concepts such as reten- 
tion and forgetting which cannot be so clearly ‘regarded as 
processes. There is often an increase in clear thinking if 
certain psychological matters are regarded as particular kinds 
of psychological processes which occur in the body instead 
of being looked upon perhaps as ‘processes in general.’ 

Neural connections.—Dr. Britt objects to my reference to 
neural connections in the definition of learning, and I have 
recently * attempted to answer this criticism by a somewhat 
analogous argument in which the differences between the 
concepts of a disease and the symptoms of a disease are 
described. 

Learning processes and learning processes.—In connection 
with the problem of non-contiguous causal influences, Dr. 

1§. H. Britt, The learning-remembering process—(A reply to Professor Cason), 


Psycnou. REv., 1937, 44, 462-469. 
*H. Cason, The concepts of learning and memory, Psycuou. REv., 1937, 44, 


54-61. 
*H. Cason, Dr. Kellogg on the definition of learning, Psycuor. Rev., 1938, 45, 


101-105. 
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Britt raises the question, “‘If today for the first time in my 
life a dancing master demonstrates to me how one should 
stand, hold one’s partner, and move one’s feet, and I do not 
engage in the motor practice of any of these activities until 
tomorrow, are we not justified in saying that the learning 
process which began today is going on tomorrow?” It 
might be suggested that if some learning takes place on both 
of these occasions, the organic learning processes which occur 
in the body are probably in some ways different on the two 
occasions. 

Remote and retroactive influences.—It is claimed that the 
reason I have not included the concepts of remote and retro- 
active influences in the definition of learning is that I have 
followed the dictionary definition of ‘retroactive’ rather than 
the ‘as if? concept which is the one used by psychologists. 
According to the dictionary, ‘retroactive’ means ‘having or 
designed to have a retrospective effect or reversed action,’ 
and Dr. Britt remarks that “The term implies only an as if 
validity, and continues to be employed only because it has 
been used so long in the literature and because its definition 
as a phenomenon does not imply an explanation.” I do not 
think that (a) the alleged ‘as if’ validity, (b) the period of 
time the word has been used, and (c) the absence of any refer- 
ence to explanation are arguments in favor of continuing the 
use of a term. Dr. Britt insists upon verbs, adverbs, and 
processes in another connection, but ‘retroactive’ is conceived 
as a ‘phenomenon,’ and Dr. Britt does not give a description 
‘ of this “‘retroactive” process. We are inclined to think that 
there would be a gain if the word ‘retroactive’ were com- 
pletely eliminated from the psychological vocabulary, and if 
wherever possible, one kept in mind what is known about the 
psychological processes that are actually taking place in the 
body. 

Dr. Britt’s concept of learning.—Dr. Britt objects to my 
separate treatments of the concepts of learning, retention, 
forgetting, and reproduction, but his definition of learning— 
which states that “‘Learning is: (1) the acquisition (2) by a 
living organism (3) of a mode of response (4) which is adapted 
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(5) to a motivating (6) problematical situation”—divides 
learning into six parts, and after giving his definition, Dr. 
Britt devotes a separate paragraph to each of these six parts. 
Is it being claimed that there are no differences between the 
psychological concepts of learning, retention, forgetting, and 
reproduction? 

In regard to Dr. Britt’s claim that “Learning is (1) the 
acquisition (2) by a living organism .. . ,” can it in a strict 
sense be said that the subject himself is always the active 
agent in the learning processes which occur in his body? 

Dr. Britt gives only a brief argument in favor of his con- 
cept of learning, and he does not seem to make it clear just 
why the definition of learning should be made to depend upon 
(3) the presence of an overt motor response, (4) adaptation to 
the environment, (5) a ‘motivating’ factor, and (6) the exist- 
ence of a problem situation. Are there not processes which 
are generally regarded as learning processes in which each of 
these four factors is absent? 


[MS. received January 13, 1938] 
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COLOR-MUSIC 


By TuHeropore F. Karwosk1 
and Henry S. Opsert 
Dartmouth College 


A preliminary survey of 274 college students showed that 60 per cent 
of them associated color with music. Thirty of these, who showed 
the most complex synesthesia, were studied intensively over a long 
enough period to show a basic consistency in the association. The 
theoretical implications of the findings are discussed. 


To the exponents of color-music as an art form these findings will 
have a special significance. 
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